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The present invention relate to novel genes from Enteroco, 
vectors, host ceils, antibodies and methods for producing the same 
Enterococcus nucleic acids, polypr.;-,;ides and antibodies in a biologic 
the prevention or attenuation of innvnon by Entemcoccus 



the polypeptides they encode. Also provided are 
Nnonally relates to diagnostic methods for detecting 
;esent invention further relates to novel vaccines for 
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' "'wwcci^cwl^^ : and polypeptide 



Field of the i nvention 

The present invention relates to . / 
5 nucleic acids and polypeptides. Also p:v\ adec 
methods for producing the same. Furl} /: :tcv 
detecting Enu rococcus faecalis using p- • : 
nucleic acids and polypeptides of the pat—a i 
to screening methods for identifying ac , ^ ;! 
10 polypeptide activity and 10 vaccines us.:. a a 



• ; >ccus faecalis genes {E.faecalis) 
ne \ ectors, host cells and recombinant 
:;et are diagnostic methods for 
a:.a: s r and antibodies to the E.faecalis 
notion. The invention further relates 
: antagonists of E. faecalis 

av nucleic acids and polypeptides. 
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Background of the Invention 

Enterococci have been recognize 
turn of the century when they were firs; 
15 microscopic organisms. The genus Ente 
faecalis or E. faecalis which is the mosi 
80 - 90 percent of all cnlcrococcal infeca 
Microbiol Infect Dis.9:l 11-117. 

The incidence of enterococcal un 
enterococci are now the second most frc 
Enterococca: -infection is of particular c 
Recent atten a on has focused on enteroca 
nosocomial infections, but also because 
antimicrobial agents. These factors are a 
25 enterococci to survive in an environment 
used; the hospital setting provides the an 
susceptible bacteria, thereby providing a 
and the hospital also provides the potent 
via the usual routes of hand and enviro 



actons 



cing pathogenic for humans since the 
aned ay Thiercelin in 198S as 
: a, eludes the species nnterococcus 
; n a tiogcn in the group, accounting for 
V< : a .Mset al. (1990) Eta J. Clin 

at; increased in recent years and 
eported nosocomial pathogens, 
ea ase of its resistance to antibiotics, 
a because of their increasing role in 
aakable and increasing resistance to 
a a forcing since resistance allows 
antimicrobial agents are heavily 
a- nich eliminate or suppress 
advantage for resistant organisms, 
semination of resistant enterococci 
nomination. 
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Ann: wobial res: -^ance can be v 
mtnnsie pro- . e and th; which is acqi: 
other specie- wractens; :cs, appear to is 
resistance a- . from or ;er a mutation 
DNA. The \ ;.: sous inherent traits exprc- 
semisynthet.; penicillina.^-resistant pen 
aminoglycosides, and lov. levels of clinri. 
include resis- .ee to chloramphenicol, ei 
tetracycline. high levels c : aminoglycosn: 
nuoroquino: sees, and vancomycin. Res; 
penicillinase ...,a resistance to Hiioroquu 
transposon mediated and presumably ai\ 
Although the man; reservoir for e 
tract, the bac.cria can alsc reside in the g; : 

E.jci^.iUs has emerged as an inn 
urinary tract Sections (I'TJs), intraabdc 
neonatal sepsis. See Lewis et al. (1990') 
became firmv established as major nose , 
leading cause of hospital-acquired infect:, : 
the United Slates. Fatality ratios for enh : 
with death due to enterocaccal sepsis m - 
(1993) Clin. Microbiol. Rev. 6:428-442. 

The ahlity of enierococci to cole: 
intrinsic and squired resistance traits, m 
seem to have : datively lew intrinsic vinne: 
become secondary invaders. Since nosoe. 
resistance to essentially every useful ant:: 
increasingly difficult to successfully trea 



■ o genera] tvnes 
- '-^ tor intrinsic i e 
'-romosome. Ac. 
^ T DNA or acquis:' 
■cocci include res: 
oosporins. low k - 
nples of acquired : 
■ngh levels of clirn: 
by means of pen:; 
■ levels of penicil : 
'■-t known to be pi;- ^ 
' -tion(s). 

e humans is the ga* 

ethra and vagina. 

een in endocarditis 
" -ons, soft tissue in: 

oe 1970s and 1980 
*'gens. They are in j 

n d leading cause o: 
^ icrmia range from 
■hese cases. See T s 



: :erent or 
-tancc, like 
red 
'wn of new 
e lance to 
-Is of 
csistance 
iamycin, 
dlinase, 
a without 
:nid or 

:omtestinal 



bacteremia, 
ections, and 
enterococci 
v the fourth 
bacteremia in 
! 2% to 68%, 
i. Emori 



'^intestinal tract, plus the many 
se organisms, which usually 
en an excellent opportunity to 
-■s of enterococci have displayed 
-nt, it will ILkely become 
cnterococcal infections. 
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Particularly v. aon the varums resistance < -ether in a single strain, an 

event almost certain to occur at some tin.. a, a e:eae. 

The etiology of diseases mediated r . o r.ed by Enlerococcus faecalis, 

involves the pi -..grammed expression of E , , ; ■ a -es, and that characterizing these 

5 genes and their patterns o: expression we .. .a a.oe .aally add to our understanding of 

the organism nod its host interactions. Kr. ■ <■: ihe E. faecalis gene and genomic 

organization would improve our understa ; ; . a -die-case etiology and lead to 

improved and new ways of preventing, t: a . a agnosing diseases. Thus, there 

is a need to characterize the genome of £ a for polynucleotides of this 
10 organism. 

Summary of ihe Invention 

The present invention provides fo: a or; . . faecalis polynucleotides and 
polypeptides shown in Table 1 and SEQ d \o o :g h SEQ ID NOo 96 

15 (polynucleoiioe sequences having odd SL id) d-d;, and polypeptide sequences 

having even SEQ IDNOsa One aspect co o.e o. or. ion provides isolated nucleic acid 
molecules comprising polynucleotides ha r e a a rd oicie sequence selected from the 
group consisting of: (a) a nucleotide scqeeioe shew;- m Table 1; (b) a nucleotide 
sequence encoding any of the amino acid scaacures of the polypeptides shown in 

20 Table 1; and (c ) a nucleotide sequence com: lememary to any of the nucleotide 

sequences in on or (b). The invention furea : e,vdo for fragments of ihe nucleic 
acid molecules of (a), (b) co (c) above, 

Furtlier embodiments of the invent a ao isolated nucleic acid molecules 
that comprise a polynucleotide having a r.a, ;oaoac sequence at least 90 c /,'» identical, 

25 and more preferably at least 95%, 96%, r . 0y ■ 09% identical, to any of the 
nucleotide sequences in (a), (b) or (c) abc a ; ; ao nucleotide which hybridizes 
under stringea; hybridization conditions \ . • a eotide in (a), (b) or (c) above. 
Additional nueieic acid embodiments of t:o aercaaoa relate to isolated nucleic acid 
molecules comprising polynucleotides who, ru. ue the amino acid sequences of 
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ention fi:: 
;ence selc 
peptides , 



epuope-be;-;:-- c portions 
t a ) above. 

The p: c<cnt inve: 
isolated iuu . : . acid me. 
5 the recomb;;;.. r. vectors. 
The present ::. . ention fe 
E. faccalis po ; peptides 

The ;: 
amino acid se 
K) any of tiic po 

The p -.peptide 
an amino acid sequence ■ 
80%, 85%. 9f- „, 95%, 9 
1, as well as p. i\ peptide 
15 preferably ai .east 75% i, 
96%, 97%, „ 0 r99% 
molecules encoding suet. 

The present inver 
comprising one or more . 
20 in Table ] , or fragments \ 
carrier, or excopient, whe 
effective to ei:c;i an imnr. 
least E. facci;. : , in an an 
may further be combined 
25 Enterococcal or non-Entc 
intended to eiiei; an lmrm 
genus and, optionally, or,. 

The \ ..c.mes of ti. 
"naked" DNA. wherein i: 



■ a A'. lavcalis 

'Mn also relate 
. :des of the pi 
• s well as to m 
-tier relates to 
r peptides by ■ 
eier provides is 
led from the l 7 : 
escribed in Ta:~ 
of the present 
r.h at least 70 : 
%, 97%, 98%. 
having an ami; 
entical, and stn 
dentical to tluv 
polypeptides. 



lurthcr pi\ 



'f the E. faecal; 
Hereof, togethe: 
em the E. face. 
me response to 
imal. The £._;'.■ 
with one or me: 
ococcal organ i: 
uological respi 
or more non-J:. 
e present inver. 
e DNA encode 



■ inc an amino :d sequence in 



. ot vectors, whici 
. and to host ceo 

::>g such vectors : 
: vectors in the ; 

echniques. 



o_ o/% polypeptides 
e of an ammo ace. 
. .eats thereof, 
include polypcp 
r.d more preferab' 
■ r.rity to those des, 
-o:;ce at least 70% i, 
: eoahly 80%, 85%. ^ 
■■• ell as isolated mu 



oiclude the 
containing 

■ '.d host cells. 
: oduction of 

.aving an 
sequence of 

:des having 

■ at least 75%, 
;bed in Table 
entical, more 
:■%, 95%, 

eic acid 



"... .c or multi-compor.ont vaccine 

e 'odes or polypeptides described 
e: : -aceutically acceptable diluent, 
e: o;lc(s) arc present i i an amount 

e: :he Enterococcus genus, or at 
Jy;- jptides of the present invention 
oeens of one or more other 

o : :e a multi-component vaccine 
> lembers of the Enwrococcus 

■ organisms, 
o '.ministered in a DNA form, e.g., 
r . :e Enterococcal polypeptides 
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and, optionally, one or mere polypeptides 
encoding one o;- more polypeptides may io 
are expressed ;:s fusion proteins. 

The var sines of the present inverse. 
5 component of;; geneticaliv engineered oil: 
engineered organism or heist cell which ex: 
may be administered to an animal. For ex.;.. 
organism or host cell may contain one or so 
invention intrac cllularly, on its cell surfac. 
10 a genetically engineered organism or host ee 
polypeptides. The vaccines of the presen; : 
an animal with an immune system modular 

The invention also provides a metis 
in an animal to one or more members of live . 
15 more isolates of the E. faecalis species, con. 
vaccine as described above. 

The invention further provides a nee 
response in an animal, sufficient to prever.; 
members of the Enterocolitis genus, pre fere. 
20 comprising administering 10 the animal a ee, 
polynucleotides or polypeptides describes e 
these polypeptides, or fragments thereof ;v 
and/or administered in admixture with an ;ss 

The invention further relates to aim; 
25 administration of one or more E. faecalis re 
methods for producing such antibodies arc : 
relates to recombinant antibodies and fraem. 
such antibodies and fragments thereof 

The invention also provides diagivm 



...erococcal organism. The DNA 
eh such that these polypeptides 



af . he administered a 
is a cell. Thus, a ger. 
a;e or more E.faecali: 
. o genetically engm< 
so oo7/5 polypeptides r 
s : ■ s aplasmic space, i 
e - ete one or more E. 
ni saay also be co-achv 
m -X6 andGM-CSF) 
iseoig an immunologic 
a-: nts genus, prefcrah 
: ;s. ministering to the : 



.tically 
polypeptides 

ered 

f the present 
urther, such 
aecalis 
mistered to 

al response 
iy one or 
limal a 



a: seeing a protective immune 
see. or control an infection by 
less; E. faecalis species, 
■ OS: eomprising one or more of the 
a ! or fragments thereof. Further, 

s o gated to another immunogen 

ehested in an animal b\ the 
o sfe: of the present invention and to 
sse thereof. The invention further 
er ; and to methods l or producing 



for detecting the :xpression of 
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the polym.eie 


odes of T 


v :e i by merr/v 


. i 'occccus genus 


. . an animal. 


One such me - 


- -'Vol-. 


~ - assaying for 


:t a pol>nuoieo: 


c encoding 


E.juccu!::' p 


_ peptide 


:i a sample in 


This expression 


cl>* be assayed 


either direct!; 


..i;. by , 


saying polype; 


eg antibodies ei: 


. led in 


response to an 


::e acid :■ 


auenees descr: ; 


or indirectly t i 


. , by assaying 


for antibecie 


o/i ng sp: 


: in city for amo 


;es described in ' 


1 .bie 1). The 


expression of 


" dynuclcv 


.ides can also b . 


.electing the nuc. 


. e acids of 


Table 1. An ■ 


ompie of 


•oich a method r 


oo ose of the polvmcr 


. e chain 


reaction (PCI 


:o ampii ; 


and detect En: 


o : ,eic acid sequcm 


:s. 


Tr.cp: 


. o enl inve 


.lion also relate^ 


■ ;d probes having 


d or pan of a 


nucleotide se 


oence des. 


"bed in Table i 


i i NOs) which are 


- apable of 


hybridizing u: 


der string; 


■:u conditions to . 


nucleic acids. T 


e invention 


further relate- 


"o a method of detecting on. : o. 


- . :crococcus nuclei 


o acids in a 


biological saio 


e obtain l 


d from an ammo.. .: 


■ 'no re nucleic acid 


encoding 


Enterococcu:- 


olypeptu 


cs. comprising: 


"" '.0 the sample wit!: 


one or more 


of the above-. 


escribed :\ 


ocleic acid prob 


- o. iitions such that ! 


bridization 


occurs, and ('•■ 


detecting 


hybridization o . ■ . 


: -.ore probes to th. 


Enterococcus 


nucleic acid p ; 


o sent in th 


e biological sam;: . 






Other 


:ses of the polypeptides of' 


c ;* .vention include: 


inter alia^ to 


detect E. faea. 


is in mir. 


unoassays, as epo ::c 


.:■ molecular weigh 


. markers on 


SDS-PAGEf, 


■Is. as mok 


cular weight mar- 


■: - cular sieve gel Til 


ration 


columns, to g- 


aerate anybodies that spec: :.i\ 


":::o E. faecalis polype; 


tides of the 


present mvern 


on for the 


detection E. fac. . . 


oioassavs. to gene 


ate an 


immune resp^ 


:se agains: E.faecalis and . ■ o f 


"ecus species, am 


as vaccines 


against E. fa a 


is, other 


Enter oca ecus spo . , 


ooi . ,:er bacteria genus e 




Isolate 


;: nucleic a 


cid molecules of .. : r- 


c c:" ::i vention, parti cu: 


;riy DNA 


molecules, arc 


useful as ; 


*robes for gene io, : ; :, 


o . or identifying E. /. 


:ccalis m a 


biological sar. 


pies, for u 


stance, by Soul. :.: 


mem blot analyse 




Polynucleotic 


of the p: 


esent invention . 


. in detecting E. : 


ccalis by 
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PCR using pria.ers for a i articular E. face: : . ; . eotide. Isolated 
polynucleotide - of the present invention : in making the polypeptides of 

the present in\ en lion. 



Detailed Description 

The present invention relates to rcc . 
fragments thereof. The present invention a 
polypeptides raid fragments thereof. The :: 
these polypeptides to produce immunologa 
protection to disease caused by members e 
of the E,faecais genus. The invention fur 
encode antigen :e E.faecalis polypeptides a 
nucleic acids and polypeptides in biologica 
antibodies specific for the polypeptides an 
methods for defecting such antibodies proc: 



-.!,; ; faecalis nucleic acids and 
Me; rcanes to recombinant E. faecalis 
eiv.ic;; . > relates to methods for using 

lesee: : cs and to confer immunological 
e . .;. Enterococcus, at least isolates 
: el .;e re nucleic acid seqtiences which 
: to i- .ciarls for detecting E.faecalis 
■.amp.es. The invention also relates to 
aeneac< , f the present invention and 
ed m a east animal. 



Definitions 

The fol'. owing definitions are prova .a: aa rear a/ the subject matter which the 
inventors consider to be i he present invema ;r 
20 As usee herein, the phrase "pathogeny aaca:" means an agent which causes a 

disease state or affliction m an animal. Inc. .are. v, alaa this definition, for examples, 
are bacteria, protozoans, fungi, viruses and a.etar aarasites which either produce a 
disease state or render an animal infected a a rue a . organism susceptible to a 
disease state (t ..sa-, a secondary infection). a dae a.„ aided are species and strains of 
25 the genus Enta ococcus which produce disc e siarr :n animals. 

As used herein, the term "organism' near ; arc living biological system, 
including virus es, regardl ess of whether it a : ! a:a a c::c agent. 

As usee herein, the tenn "Enteroa . us" means any species or strain of 
bacteria which is members of the genus Err. \ . Such species and strains are 
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known to t.ac 


;' >k0; : 


me art. a no :m 


arc not. 






As us. 


. 


:e phrase "one s 






polypeptides . 


5 more of the I. 


.u:ijji:s s 


lypepudes desc 


t w 1 e y v r 1 ^ J 


. v ' ■> , v . 


" res sec as His ion 


polypeptides 


: 'he pres 


oot invention are 


which ma}' be 


f Enters 


occal or non-En; 


includes polyp 


eptide ce: 


uprising fragmeo 


:o present mver... 


em Ado; 


;oi;al definition:: 


Exulanatioji 

JT 


r Table J 




Table 


. below, p 


sovides informal : 


polypeptides ■ 


E. face ^ 


is. The table lis: 


15 letters EF. wh: 


sh denote 


::. fa ec a I is, folic. 


code, which :o 


■Mtranly r. 


amber the E. fac\ 


number from 


through - 


: follows the thro 


Full length, ope 


. readme 


rame of the gene 


number. A n.. 


mbcr2 re 


^resents the full 1. 


20 specified tne e 


ooccedmg 


three digit numb'. 


fragment, of is 


e gene re] 


oesented by the p 


produce an an 


i genie po- 


ypeptide. A nun 


fragment , 01 : . 


ne gene represented by the 


stimulate an i ^ 


online res 


~onse or as a vac, 


25 sequences of e 


.eeh gene : 


nd fragment arc a 


SEQ ID NO , 


-ted in Ta 


oe 1 . 


Explanation, r 


Table 2 





Table : lists accession numbers fc: 



: are pathogeny and those that 

o//o poiypeptic.es of the 
-lino aco; seqoence of one or 
■ 1 i.even SEQ ^ NOs). These 
.so: . . : em the £. o;ec oo 
■ eo ' ::t;onal ammo acid sequences 
- s. :m. This phrase further 

.:ecalis polypeptides of the 
so throughout the specification. 

-• • • oo genes which encode 
oc ;jo.; osentifier which consists of the 
. oo. , . oo. oly by a three digit numeric 
ceo. " :"the present invention. A 
p " o;o A number 1 represents the 
oooo • the preceeding to.ree digit 
o o 'ypeptide encoded by the gene 
A :ooo'oo- 3 represents a polynucleotide 
ec ooe three digit number, used to 
o: -> o o oo: -ents an antigenic polypeptide 
oooo. oo tnree digit number, used to 
. "To o ,b;eotide and amino acid 
sio o; the Sequence Listing under the 



matching sequences between 
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the polypeptide; of the present invention ;:: 
Dcrwent databases. These reference numb 
commonly use. by those of skill in Ihc an. 
denominations The desenptions of the mi: 
the National Center for Biotechnology Info- 
of the present invention. Column 2 lists tlu 
sequence in GcnBank or Derwent database* 
"match" gene sequence. Columns 4 and 5 :! 
probability, respectively, calculated by Bl 
invention that .o not share significant ider 
sequences of CienBank and Derwent are m 
the present invemion that share significant 
the polypeptides of GcnBank and Derweni .: 



'.no- 
are ; 
no \v; 
•elan 
..alio:, 
ces^ . 
Cohn 
■ the in 
ST. r 

•' sir:.: 
eprcsc 
eruitn ■ 
e rcp;\ 



liable through GenBank and 
^ database entry numbers 

farnilar with their 
- ior GcnBank arc available from 
■lumn 1 lists the gene or ORF 
number of a "match" gene 
■ - lists the description of the 
-'h score and smallest sum 
^peptides of the present 
•• ; !:v with any polypeptide 
:: - in Table 2. Polypeptides of 
clarity with more than one of 
sen ted more than once. 



15 Explanation of Table 3. 

The E. faecali-. polypeptides of the prcscn: 
conservative annuo acid substitutions froir. 
as indicated in Table 3. Changes are prefer: 
amino acid substitutions that do not signify 

20 protein. Residues from the following group 
substituted for one another: Aromatic, Hydr 



.ent;n 
mural ; 

y of. 
ndy ; 

as ind 
nhobn- 



'> "iay include one or more 
'.stations or human manipulation 
i-M-nor nature, such as conservative 
mn the folding or activity of the 
en led in Table 3, may be 
■ p oiar, Basic, Acidic, and Small, 



Explanation of Table 4 

Table < lists residues comprising an nenic e: 
25 bearing fragments prcscn; in each of the fu; engtl: / 
in Table 1 as predicted by the inventors usn n the a- 
(1988) Comp. Appl. Biosei. 4:181-186. Tin ame 
performed usine the computer program PR; PEA- . 
Macintosh. D \ ASTAR, inc., 1228 South 1 ,k St, p 



^;opes of antigenic epitope- 
:<;ecalis polypeptides described 
nthm of Jameson and Wolf, 
- vVolf antigenic analysis was 
'^■rsion 3.1 1 for the Power 
Madison, WJ). E.faecalis 
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cs Tor each full 1. 
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. .uence shown in Table 1 and in 


the Sequence . <\uml. v> 
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■resent ic. 


::ition that do not have 


antigenic erit cs reccec. 


eed by the Jamcc 


n-WolJ . 


^ ^nthm are not represented in 


Table 2. 









10 Selection p/ A:, .Wc Acui . ccjuences Encod. ? Aniigv : E.faecalis Polypeptides 

Scquce, -a! E/.V. ..•/£* genomic DN was oh.:: ::ed from the E.pccalis strain 
V586. The E. :;ecaiis strain V586 was deposited 2 iVir.y 1997 at the ATCC, 10801 
University Bh a. Manasr,,s. VA 201 10-22'. >, and give;, accession number 55969. 
Some «. :RFs conusance! in the subset r>f fragmer.-.s of the E.faecaUs genome 

15 disclosed here, were deed through the i. e of a n,, er of screening criteria detailed 
below. The OK Fs are bo . .ruled at the amir., terminus -v a methionine or valine 
residue and usually at the carboxy termmu. by a step codon. 

Most the scie, :ed sequences consist ofc-aariete ORFs. The polypeptides 
that do not comprise a complete ORF can b> determined by determining whether the 

20 corresponding polynucleotide sequence comprises a Si; >p codon after the codon for 
the last ammo acid rcsid,,c in the polypeptide sequence. It is not always preferred to 
express a complete ORF n a heterologous vstem. It stay be challenging to express 
and purify a highly hydi. -phobic protein b;. common ..:b oratory methods. Some of 
the polypeptide vaccine candidates described herein have been modified slightly to 

15 simplify the production n' recombinant protein. Fo: example, nucleotide sequences 
which encode oaghly hydrophobic domain: such as loose found at the amino terminal 
signal sequenc e, have been excluded from .- >me const: acts used for expression of the 
polypeptides. Furthermore, any highly hydrophobic a anno acid sequences occurring 
at the carboxy terminus have also been exceeded fro.r. :oe recombinant expression 
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constructs. Tie.- . in one embodiment, a po: /peptide v. inch represents ;i iruncated or 
modified ORF a., ay be used as an antigen. 

While numerous \: ethods are known in the an \ev selecting potentially 
immunogenic ; -peptide a m any of the Oh Fs disci : e . herein were selected on the 
5 basis of screening Entcroi occus faecalis ORFs for several aspects of potential 
immunogeniciiY One set of selection criteri a are as foii.-nvs: 

1. Typn : signed se ntence: An anun. termin. ype 1 signal sequence generally 
directs a nascent protein a ■,. ross the plasma raid outer membranes to the exterior of the 
bacterial cell. Anpcrimental evidence obtain, d from studies with Escherichia coli 

10 suggests that the typical p\pc i signal sequenee consist- -of the following biochemical 
and physical attributes (Izard, J. W. and Kendall, D. A MoL Microbiol. 7J:765-773 
(1994)). The lei:gth of the type I signal sequence is approximately 15 to 25 primarily 
hydrophobic ammo acid residues with a net positive charge in the extreme amino 
terminus. In aeemon. the .entral region ol'ii.e signal sentience adopts an alpha-helical 

15 conformation in a hydrophobic environment Finally, tae region surrounding the 

actual site of cleavage is ideally six residues :ong, with small side-chain amino acids in 
the -1 and -3 positions. 

2. Type / V signed sequence: The type IV signal sequence is an example of the 
several types of functional signal sequences which exist in addition to the type I signal 

20 sequence detailed above. Although functionally related, the type IV signal sequence 
possesses a un...ue set of biochemical and physical attributes (Strom, M S. and Lory, 
S., J. Bacterial ' ~^A345 A^5 1 (1992)). These arc typically six to eight amino acids 
with a net basin charge followed by an addidonal sixteen to thirty primarily 
hydrophobic residues. The cleavage site of a type IV signal sequence is typically after 

25 the initial six to eight ammo acids at the extreme amine - terminus. In addition, type IV 
signal sequences generally contain a phenylalanine residue at the +1 site relative to the 
cleavage site. 

3. Lij)ceroicin: Studies of the cleavage sites oA twenty-six bacterial 
lipoprotein precursors has allowed the definition of a c nsensus amino acid sequence 
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for lipepro:::: . ;..\:.o.. '-;;i:lv three- . 
examined com...:. ed the . ....cnce I .-« /-..> - 
the point o; el . ..ce 1 II. ami. S. ami V, _ 
(1990 V). 

- 4. /./- ■ ■/■.- ; ■ been eve/ 

proteins foun, :-, me v.- :U v of gram- pes; ' 
carboxy term:/.... >eemeo. /. More than :e 
pyogenes, S. /<..//;;<. /: , ,//;.v, 5. pnehu:. 
on their extrac -Ih.iar lec/.oon and carbo>;v : 

in A., ASA/ Abe ;.40r-:: i 1 996V). The 
ammo acids a: : ,e extreme .arboxv termim 
acids presume. :e fimctn a as a transmemb: 
transmembrane eemam n a six ammo acid 
examined. The ;;;nmo /.../! sequence of t!m 

15 amino acid. 

An ale oihm for ; /lee ting antigenic 
polypeptides ; lading el./ foregoing cruem 
to that describe., m L.S aieni application, 
is fully incorpo: ened by reference herein. I 

20 select immune ogicnlly useful Enicroc me ; 
selection of a number of me disclosed ORf 
polypeptides identified m this group may 
art and as furtVei described herein. 



e ol b e/erial lipeprot. m precursors 
i.A )-C sitions -3 to -r ; . relative to 
f C..J. .nerg. BioiKcmr-. ^2:451-471 

nentally e-ermmed that m est anchored 
ve bactee . possess a highh conserved 
such pr e as from organisms such as 5. 
:ae % anc; ers, have been identified based 
/minal a; o acid sequence (Fischetti, V. 
seived : ee n consists of six charged 
couple,: .o 1 5-20 hydrophobic amino 
me dom. e Immediately adjacent to the 
equence / - /.served in nearly rdl proteins 
region i> P-X-T-G-X. where X is any 

ad immo e genie Enterococ. us faecalis 
was de- e ned. The algorithm is similar 
■S/78lA v e filed Januar>' 3. 1 997, which 
sc of the eiuorithrn by the inventors to 
faecalis polypeptides resulted in the 
■e Polypemides comprising the 
/ produce/ :w techniques standard in the 



25 Nucleic Acid Molecules 

Sequenced E. ,,caiis genomic DNA was r taincd iV- . the E. faecalis strainV586. As 
discussed elscu. acre heiv/m. polynucleotides of the • -ent invention readily may be 
obtained by roumic application of well known and see: lard procedures for cloning 
and sequencing I D N A . beamed methods io: obtamme libraries and for sequencing are 
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provided beK .-. . for msemcc. A wide variety of Emc < -coccus faecalis strains that can 
be used to prep;: :"e E. fee- aiis genomic DN'A for clonine and for obtaining 
polynucleotides and per. peptides of the p: esent invention. A wide variety of 
Enterococcus ■'accaiis strains arc available i the pur's from recognized depository 

5 institutions, st.ch as the American Type (C.ture Coiiceiion (ATCC). It is recognized 
that minor va; ration is the nucleic acid ami amino ace: equence may be expected from 
E faecalis strain to strain The present inv ention provides for genes, including both 
polynucleotides and polypeptides, of the oi the present invention from all the 
Enterococcus i accaiis strains. 

10 Unless otherwise indicated, all nucleotide sequences determined by sequencing 

a DNA molecule herein were determined :r ng an am. mated DNA sequencer (such as 
the Model 37/ from Applied Biosystcms. inc., Foster City, CA), and all amino acid 
sequences of pie y peptides encoded by DN.e molecules determined herein were 
predicted by translation of a DNA sequence determined as above. Therefore, as is 

15 known in the art for any ON A sequence determined by this automated approach, any 
nucleotide sequence determined herein mar contain sonic errors. Nucleotide 
sequences determined by automation are typically at eeast about 90% identical, more 
typically at least about 9e% to at least abort 99.9% identical to the actual nucleotide 
sequence of the sequenced DNA molcenie. The actual sequence can be more 

20 precisely determined by ether approaches including manual DNA sequencing methods 
well known in I he art. A* is also knowr. m the art, a single insertion or deletion in a 
determined nucleotide seetuence compared t 1 the actu.u sequence will cause a frame 
shift in translation of the nuelcotide sequence such that the predicted amino acid 
sequence encoded by a determined nucleotide sequcnee will be completely different 

25 from the amino acid sequence actually encoded by the sequenced DNA molecule, 
beginning at tne point of such an insertion cr deletion. In case of conflict between 
Table 1 and cover the nucleic acid sequence of the cienes listed in Tabic 1 or the amino 
acid sequence of the protein expressed bv ti c clones listed in Table 1, the clones listed 
in Table 1 are eontrollinc. Hy "nucleotide sequence' of a nucleic acid molecule or 
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In adeun u. isouacd nucleic acid .cules of a,e invention include DNA 
molecules wived eo; up; e • :i sequence subs; enhally different from those described 
above but whvd. due io : : .c degeneracy off c genetic evde, still encode a E. faecalis 
polypeptides and peptide od the present a ention re u. polypeptides of Table 1). 

5 That is, all possible DNA sequences L h a : ce.eode the 'V faecalis polypeptides of the 
present invention. Tin:; includes the geivetn code and wecies-spccific codon 

preferences k: :o a u in the ad. Thus, u \w. be iv for one skilled in the art to 

generate the degenerate variants described a ove. ibr instance, to optimize codon 
expression for a particular host (e.g.. change codons in the bacteria m.RNA to those 

10 preferred by a mammalian or other bacterial nost such, as E. coli). 

The invention hirdici" provides isokv.ed nucleic a :id molecules having the 
nucleotide sequence shown \y. Table 1 or a nucleic acid; molecule having a sequence 
complementary to one o! die above sequences. Sued, : .dated molecules, particularly 
DNA molecules, are use: ..: ^ ; probes do; eeae mapping and for identifying E, faecalis 

15 in a biological sample, for instance, by PCf: Southern ' lot, Northern blot, or other 
form of hybndization analysis. 

The present m\ cudov is further adivvaed to nucleic acid molecules encoding 
portions or fragments of die nucleotide seen, aces described herein. Fragments include 
portions of the nucleotide sequences of l ab a 1, or the E. faecalis nucleotide 

20 sequences contained u. the p.asimd clones a led in da a 1, at least 10 contiguous 
nucleotides in length seleate,. from any two ntegcrs. one of which representing a 5' 
nucleotide position and a second of which representing a 3' nucleotide position, where 
the first nucleotide lor each nucleotide sequence in Tabic 1 is position 1 . That is, 
every combin.iiion of a a' an i 3' nucleotide : osition nav. a fragment at least 10 

25 contiguous nucleotides a wagth could oeea v is incdeaed in the invention. At least 
means a fragment may be i 0 contiguous nuc.eotide da: as in length or any integer 
between 10 and due lenede. o! an entire nueae elide sequence of Table 1 minus 1. 
Therefore, included in the invention are coavguous o; enients specified by any 5' and 
3' nucleotide vase positions of a nucleotide . cquences of Table 1 wherein the 
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conditions" : lmnuied ■ ermght inctma. - at A2 v a ;i solution comprising: 50% 
formamide. :m : ; SC ( 1 :"■ • ' 1 And, 1 : a A tnsodium .urate), 50 mM sodium 
phosphate (y 1 •>;. As 1 . ; ;nardt's so dd :■ a ' 0% dc.\:::m sulfate, and 20 |jg/ml 
denatured, shea: mi sain.'. :, s mrm DN A. m. mved lm • hing the filters in O.lx SSC at 
5 about 65°C. 

By a poivnuclet em c which hvbn dices to a "pinion" of a polynucleotide is 
intended a polynucleotide t mther 13 N A t : \ T A) hvh; mazing to at least about 1 5 
nucleotides hascv and mere preferably ad ic ; ;t abou; i o nucleotides bases, still more 
preferably a i leas; about A: nucleotides mn.^. and even more preferably about 30-70 

10 (e.g., 50) nucueoudes basc^T the reference olynumc. dde. These arc useful as 

diagnostic pr b:: and ;• r. mss as discusser .hove. My a portion of a polynucleotide 
of "at least 20 maclcotiue-' A ses in Icrmd : r example, is intended 20 or more 
contiguous nucleotides base; nucleotides in an the nucleotide sequence of the reference 
polynucleotide [e.g.. the nucleotide sequcm a as show ; :n Table ] ). Portions of a 

15 polynucleotide " inch h\ aaizes to a reus' ; de sequence in Table 1. which can be 
used as probes a ad prime:-, may also be precisely specified by 5' and 3' base 
positions or by size in nucleotide bases as described amove or precisely excluded in the 
same manner 

The nucleic acid mmecules of tier present in veuuon include those encoding the 
20 full length E. faecal;;; polypeptides of Table 1 and portions of the E.faecalis 

polypeptides of fable 1 Also included n: tm present invention are nucleic acids 
encoding the nbc-vc full ionuih sequence-; an . further comprise additional sequences, 
such as those er.eodmg aa raided secretory acier sequence, such as a pre-, or pro- or 
prepro- proteai sequence Further included m the present invention are nucleic acids 
25 encoding the above full numbi sequences rem portions 'hereof and further comprise 
additional heterologous ammo acid sequence-; encoded ' v nucleic acid sequences from 
a different source. 

Also \ a ended m :ec present invent;. ■ ; arc nuAar:; acids encoding the above 
protein sequences together with additional. ; n-cocid. e >equcnces, including for 
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See. e ,e . 8. Lewim Genes IV (1990). 
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more nuclecw c 'i he ... aaas nw ewe w: .. in cow c regions, non-coding regions, 
or both. A!a en . -. : ■ s i . wimp region- •' proa : -anservative or 
non-conserva e, a amine ; c:e. substitution. . :!cuons ; r .additions. Especially 
preferred an; a a awsc ,cc sacra substmea .■ . addm • • and deletions, which do not 
5 alter the proper; w. ami .-;:■. ' we of a a ;iis proaw; of the present invention or 
portions theme . Also c-'wwady preierw-f this re. : : are conservative 
substitutions 

Such wncaeplwc caaants mcluar a. sc proee.. cad by amino acid 
substitutions, dc. aliens a .wamoas. The : wiinrn: : . deletions, or additions may 

10 involve one or mere reswear Alterations mey produce conservative or 

non-conserva awe amine aea: subsiiumo: aa ecictions. w additions. Especially 
preferred ami :u :.ee.w a; wscm subsumae.. . add a. - and deletions, which do not 
alter the propeiT.es and . ww ma- aid: a e.s .« lis pro: e a: of the present invention or 
portions theree.. Also especially p:cie : ca ... ilus rcawa: are conservative 

15 substitutions 

Thep e-eai no . we. e aa a reia;e- a wcombu.wu vectors, which include the 
isolated nucle.e . : cid mo-comes of die pe e., invention, and to host cells containing 
the recombinant \vc:oi •. as -e/cll as to nwawws of making such vectors and host cells 
and for using them i or w eaacuoa of E. a;. .;iis pop. -peptides or peptides by 
20 recombinant tecniuques 

The preseni appaenmm is dirccteu a nucleic ,;aa molecules at least 90%, 
95%, 96%, 9 " °fs% or <n 'y- ; identical te a aacleic aea: sequence shown in Table 1. 
The above mwow aeici scans: ices are uumwea irrespective of whether they encode a 
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mRNA expression samples, environmental samples, suspected of containing E. 
faecalis by Northern Blot ar s. 

Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 
97%, 98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, 
5 in fact, encode a polypeptide having E. faecalis protein activity By "a polypeptide 
having E. faecalis activity" is intended polypeptide i xhibiting activity similar, but 
not necessarily identical, to an activity of the E.faet irfis protein of the invention, as 
measured in a particular biol . 1 assay suitable fui measuring activity of the 
specified protein. 

10 Due to the degeneracy of the genetic code, one of ordinary skill in the art will 

immediately recognize that a large number of the nucleic acid molecules having a 
sequence at least 90%, 95%, 9(5%, 97%, 98%, or 99% identical to the nucleic acid 
sequences shown in Table 1 will encode a polypeptide having E. faecalis protein 
activity. In fact, since degenerate variants of these nucleotide sequences all encode the 

15 same polypeptide, this will I ar to the skilled artisan even without performing the 
above described comparison assay. It will be further recognized in the art that, for 
such nucleic acid molecules that arc not degenerate variants, a reasonable number will 
also encode a polypeptide having E. faecalis protein activity. This is because the 
skilled artisan is fully aware of amino acid substitu i is that are either less likely or 

20 not likely to significantly effect protein function (e.g., replacing one aliphatic amino 
acid with a second aliphatic \ » acid), as further described below. 

The biological activity - function of the polypeptides of the present 
invention are expected to be similar or identical to polypeptides from other bacteria 
that share a high degree of structural identity/similarity. Tables 2 lists accession 

25 numbers and descriptions for the closest matching sequences of polypeptides 

available through Genbank and Derwent databases. It is therefore expected that the 
biological activity or function of the polypeptides oi the present invention will be 
similar or identical to those ] < eptides from other bacterial genusea, species, or 
strains listed in Table 2. 
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By a polynucleotide 1 ■ , 12 a nucleotide sequence at least, for example. 95% 
"identical" to a reference nucleotide sequence of the present invention, it is intended 
that the nucleotide sequence of the polynucleotide is identical to the reference 
sequence except that the polynucleotide sequence may include up to five point 
5 mutations per each 100 nucleotides of the reference - -:ieotide sequence encoding the 
E.faecalis polypeptide. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least , dentical to a reference nucleotide sequence, up to 
5% of the nucleotides in the id*: ence sequence may be deleted, inserted, or 
substituted with another nucleotide. The query sequence may be an entire sequence 
10 shown in Table 1 . the ORF (open reading frame), or any fragment specified as 
described herein. 

As a practical matter, "-^ther any particular nucleic acid mol ;ule or 
polypeptide is at least 90%, 96%, 97%, 98% or 99% identical to a nucleotide 
sequence of the presence invu-i, n can be determined conventionally using known 

15 computer programs. A preferred method for determining the best overall match 
between a query sequence (a sequence of the present invention) and a subject 
sequence, also referred to as a global sequence alignment, can be determined using the 
FASTDB computer program based on the algorithm vBrutlag et al. See Brutlag et 
al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and 

20 subject sequences are both Di ,\ cquences. An RNA sequence can be compared by 
first converting U's to T's. The csult of said global sequence alignment is in percent 
identity. Preferred parameters used in a FASTDB alignment of DNA sequences to 
calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty-!, 
Joining Pcnalty-30, Randomisation Group Length )■ Cutoff Score=l, Gap 

25 Penal ty=5 } Gap Size Penalty 0 Window Size=5ft0 or the lenght c; J the subject 
nucleotide sequence, which evu \* shorter. 

If the subject sequence i- shorter than the query sequence because of 5' or 3' 
deletions, not because of internal deletions, a manual correction must be made to the 
results. This is because the FASTDB program does not account for 5' and 3' 
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truncations of the subject sequence when calculating percent identity. For subject 
sequences truncated at the 5' or 3 1 ends, relative to the query sequence, the percent 
identity is corrected by calculating the number of I ;s of the query sequence that are 
5* and 3' of the subject sequence, which are not matched/aligned, as •* percent of the 
5 total bases of the query sequoin i , Whether a nucleotide is matched/aligned is 

determined by results of the i ' \ iTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This corrected 
score is what is used for the purposes of the present invention. Only nucleotides 
10 outside the 5' and 3' nucleotides of the subject sequence, as displayed by the 

FASTDB alignment, which are not matched/aligned with the query sequence, are 
calculated for the purposes 1 1 in nually adjusting the percent identity score. 

For example, a 90 nucleotide subject sequence is aligned to a 1 00 nucleotide 
query 7 sequence to determine percent identity. The deletions occur at the 5' end of the 
15 subject sequence and therefore, the FASTDB alignment does not show a 

matched/alignment of the first 1 0 nucleotides at 5' / v. The 10 unpaired nucleotides 
represent 10% of the sequent (number of nucleotides at the 5' and 0 ' ends not 
matched/total number of nucleotides in the query sequence) so 10% is subtracted from 
the percent identity score calculi ted by the FASTDB program. If the remaining 90 
20 nucleotides were perfectly matched the final percent identity would be 90%. In 

another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide 
query sequence. This time the deletions are internal deletions so that there are no 
nucleotides on the 5' or 3* of the subject sequence Viich are not matched/aligned with 
the query. In this case the percent identity calculated by FASTDB is not manually 
25 corrected. Once again, only nt i otides 5' and 3' of the subject sequence which are 
not matched/aligned with the query sequence are manually corrected for. No other 
manual corrections are to made for the purposes of the present invention. 



Vectors and Host Cell 
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The present invention also relates to vectors which include the isolated DNA 
molecules of the present invention, host cells eompi n ing the recombinant vectors, and 
the production of E. faccali > -Mypeptides and peptides of the presi nt invention 
expressed by the host cells. 
5 Recombinant constructs may be introduced into host cells using well known 

techniques such as infection, transduction, transaction, transvection, electroporation 
and transformation. The vector may be, for example, a phage, plasmid, viral or 
retroviral vector. Retroviral vectors may be replies i.»n competent or replication 
defective. In the latter case, , I propagation generally will occur only in 

1 0 complementing host cells. 

The polynucleotides ni.r. be joined to a vector containing a selectable marker 
for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, 
such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the 
vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 

15 and then transduced into host cells. 

Preferred are vectors reprising m-acting control regions to the 
polynucleotide of interest. Appropriate /wz.v-acting factors may be supplied by the 
host, supplied by a complementing vector or supplied by the vector itself upon 
introduction into the host. 

20 ln certain preferred embodiments in this regard, the vectors provide for 

specific expression, which may be inducible and/oj , dl type-specific. Particularly 
preferred among such vecto? i those inducible by environmental luctors that are 
easy to manipulate, such as temperature and nutrient additives. 

Expression vectors uselul in the present invention include chromosomal-, 

25 episomal- and virus-derived veclors, e.g., vectors derived from bacterial plasmids, 
bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as 
baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, 
pseudorabies viruses and retroviruses, and vectors J. lived from combinations thereof, 
such as cosmids and phagemirlf.' 
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The DNA insert shoi III* operativcly linked to an appropriate promoter, 
such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the 
SV40 early and late promoters and promoters of retroviral LTRs, to name a few. 
Other suitable promoters will be known to the skilled artisan. The expression 
5 constructs will further contain sites for transcription initiation, termination and, in the 
transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will preferably include a translation 
initiating site at the beginning n la termination codon (UAA, UGA or UAG) 
appropriately positioned at the end of the polypeptide to be translated. 
10 As indicated, the expression vectors will preferably include at least one 

selectable marker. Such markers include dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture and tetracycline kanamycin, or ampicillin 
resistance genes for culturing in E. coli and other bacteria. Representative examples of 
appropriate hosts include, bui nit not limited to, bacterial cells, such as E. coli, 
15 Streptomyces and Salmonella tvphimuriwn cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS 
and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE9, 
20 pQElO available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, 
pNH8A, pNH16a, pNH18A, | NI146A available from Stratagene; pET series of 
vectors available from Novagcn; and ptrc99a, pKK223-3, pKK233-3, pDR540, 
pRIT5 available from Pharmacia. Among preferred eukaryotic vectors arepWLNEO, 
pSV2CAT, pOG44, pXTl and pSG available from Stratagene; andpSVK3, pBPV, 
25 pMSG and pSVL available from Pharmacia. Other mi li table vectors will be readily 
apparent to the skilled artisan. 

Among known bactei hi I j romoters suitable for use in the present invention 
include the £. coli lac\ and Utc7, promoters, the T3, T5 and T7 promoters, the gpt 
promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic 
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promoters include the CMV immediate early promoter, the HSV thymidine kinase 
promoter, the early and late SY40 promoters, the promoters of retroviral LTRs. such 
as those of the Rous sarcoma nrus (RSY). and mctallothionein promoters, such as the 
mouse metallothionein-I promoter. 
- Introduction of the construct into the host cell can be effected by calcium 

phosphate transection, DEAE-dextran mediated transfection, cationic lipid-mediated 
transfection, electroporation. transduction, infection or other methods. Such methods 
are described in many standard laboratory manuals ffor example, Davis, el al } Basic 
Methods In Molecular Biology (1 986)). 

10 Transcription ofDN.A encoding the polypeptides of the present invention by 

higher eukaryotes may be increased by inserting an enhancer sequence into the vector. 
Enhancers are cw-acting elements of DNA, usually about from 10 to 300 nucleotides 
that act to increase transcriptional activity of a promoter in a given host cell-type. 
Examples of enhancers include the SV40 enhancer, which is located on the late side of 

15 the replication origin at nucleotides 100 to 270, the cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and 
adenovirus enhancers. 

For secretion of the translated polypeptide into the lumen of the endoplasmic 
reticulum, into the periplasmic space or into the extracellular environment, 

20 appropriate secretion signals may be incorporated into the expressed polypeptide, for 
example, the amino acid sequence KDEL. The signals may be endogenous to the 
polypeptide or they may be heterologous signals. 

The polypeptide may he expressed in a modified form, such as a fusion 
protein, and may include not only secretion signals, but also additional heterologous 

25 functional regions. For instance, a region of additional amino acids, particularly 

charged ammo acids, may be added to the N-terminus of the polypeptide to improve 
stability and persistence in the host cell, during purification, or during subsequent 
handling and storage. Also, peptide moieties may be added to the polypeptide to 
facilitate purification. Such regions may be removed prior to final preparation of the 
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polypeptide. The addition of peptide moieties to polypeptides to engender secretion 
or excretion, to improve stability and to facilitate purification, among others, are 
familiar and routine techniques in the art. A preferred fusion protein comprises a 
heterologous region from immunoglobulin thai is useful to solubilize proteins. For 

5 example, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion proteins 
comprising various portions of constant region of immunoglobulin molecules together 
with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is thoroughly advantageous for use in therapy and diagnosis and thus results, 
for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other 

10 hand, for some uses it would be desirable to be able to delete the Fc part after the 

fusion protein has been expressed, detected and purified in the advantageous manner 
described. This is the case when Fc portion proves to be a hindrance to use in 
therapy and diagnosis, for example when the fusion protein is to be used as antigen for 
immunizations. In drug discovery, for example, human proteins, such as, 

15 hlL5 -receptor has been fused with Fc portions for the purpose of high-throughput 
screening assays to identify antagonists of hIL-5. See Bennett, D. et al. (1995) J. 
Molec. Recogn. 8:52-58 and Johanson, K. et al. (1995) J. Biol. Chera. 270 
(16):9459-9471. 

The E.faecalis polypeptides can be recovered and purified from recombinant 
20 cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
chromatography, hydroxylapatite chromatography, lectin chromatography and high 
performance liquid chromatography ("HPLC") is employed for purification. 
25 Polypeptides of the present invention include naturally purified products, products of 
chemical synthetic procedures, and products produced by recombinant techniques 
from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher 
plant, insect and mammalian cells. 
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Polypeptides and Fragments 

The invention further provides an isolated E.jaecalis polypeptide having an 
ammo acid sequence in Table ]. or a peptide or polypeptide comprising a portion of 
the above polypeptides. 

Variant and Mutant Polypeptides 

To improve or alter the characteristics of E.jaecalis polypeptides of the 
present invention, protein engineering may be employed, Recombinant DNA 
technology known to those skilled in the art can be used to create novel mutant 
proteins or mutems including single or multiple amino acid substitutions, deletions, 
additions, or fusion proteins. Such modified polypeptides can show, e.g., enhanced 
activity or increased stability. In addition, they may be purified in higher yields and 
show better solubility than the corresponding natural polypeptide, at least under 
certain purification and storage conditions. 



N-Terminal and C-Terminal Deletion Mutants 

It is known in the art that one or more amino acids may be deleted from the 
N-terminus or C-tcrniinus without substantial loss of biological function. For 
instance, Ron et al. J. Biol. Chem., 268:2984-2988 (1993), reported modified KGF 
20 proteins that had heparin binding activity even if 3, 8, or 27 N-tenmnal amino acid 
residues were missing. Accordingly, the present invention provides polypeptides 
having one or more residues deleted from the amino terminus of the amino acid 
sequence of the E.faecalis polypeptides shown in Table 1, and polynucleotides 
encoding such polypeptides. 

Similarly, many examples of biologically functional C-terminal deletion 
mutems are known. For instance, Interferon gamma shows up to ten times higher 
activities by deleting 8-10 amino acid residues from the carboxy terminus of the 
protein See, e.g., Dobeli, et al. (1988) J. Biotechnology 7:199-216. Accordingly, the 
present invention provides polypeptides having one or more residues from the 
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carboxy terminus of the amino acid sequence of the E. faecalis polypeptides shown in 
Table 1. The invention also provides polypeptides having one or more amino acids 
deleted from both the amino and the carboxyl termini as described below. 

The present invention is further directed to polynucleotide encoding portions 

5 or fragments of the amino acid sequences described herein as well as to portions or 
fragments of the isolated amino acid sequences described herein. Fragments include 
portions of the amino acid sequences of Table 1, arc at least 5 contiguous amino acid 
in length, are selected from any two integers, one of which representing a N-terminal 
position. The initiation codon of the polypeptides of the present inventions position 

10 1 . Every combination of a N-terminal and C-terminal position that a fragment at least 
5 contiguous amino acid residues in length could occupy, on any given amino acid 
sequence of Table 1 is included in the invention. At least means a fragment may be 5 
contiguous amino acid residues in length or any integer between 5 and the number of 
residues in a full length amino acid sequence minus 1. Therefore, included in the 

15 invention are contiguous fragments specified by any N-terminal and C-terminal 

positions of amino acid sequence set forth in Table 1 wherein the contiguous fragment 
is any integer between 5 and the number of residues in a full length sequence minus 1 . 

Further, the invention includes polypeptides comprising fragments specified 
by size, in amino acid residues, rather than by N-terminal and C-tcrminal positions. 

20 The invention includes any fragment size, in contiguous amino acid residues, selected 
from integers between 5 and the number of residues in a full length sequence minus 1 . 
Preferred sizes of contiguous polypeptide fragments include about 5 amino acid 
residues, about 10 amino acid residues, about 20 amino acid residues, about 30 amino 
acid residues, about 40 amino acid residues, about 50 amino acid residues, about 100 

25 amino acid residues, about 200 amino acid residues, about 300 amino acid residues, 
and about 400 amino acid residues. The preferred sizes are, of course, meant to 
exemplify, not limit, the present invention as all size fragments representing any 
integer between 5 and the number of residues in a full length sequence minus 1 are 
included in the invention. The present invention also provides for the exclusion of any 
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fragments specified by N-termmal and C-terminal positions or by size in amino acid 
residues as described above. Any number of fragments specified by N-termmal and 
C-terminal positions or by size in ammo acid residues as described above may be 
excluded. 

5 The above fragments need not be active since they would be useful, for 

example, in immunoassays, in epitope mapping, epitope tagging, to generate 
antibodies to a particular portion of the protein, as vaccines, and as molecular weight 
markers. 

10 Other Mutants 

In addition to N- and C-terminal deletion forms of the protein discussed above, 
it also will be recognized by one of ordinary skill in the art that some amino acid 
sequences of the E. faecalis polypeptide can be varied without significant effect of the 
structure or function of the protein. If such differences in sequence are contemplated, 

! 5 it should be remembered that there will be critical areas on the protein which 
determine activity'. 

Thus, the invention further includes variations of the E. faecalis polypeptides 
which show substantial E. faecalis polypeptide activity or which include regions off. 
faecalis protein such as the protein portions discussed below. Such mutants include 

20 deletions, insertions, inversions, repeats, and type substitutions selected according to 
general rules known in the art so as to have little effect on activity. For example, 
guidance concerning how to make phenotypically silent amino acid substitutions is 
provided. There are two main approaches for studying the tolerance of an amino acid 
sequence to change. See, Bowie, J. U. et al (1990), Science 247:1306-1310. The first 

25 method relies on the process of evolution, in which mutations are either accepted or 
rejected by natural selection. The second approach uses genetic engineenng to 
introduce ammo acid changes at specific positions of a cloned gene and selections or 
screens to identify sequences that maintain functionality. 

These studies have revealed that proteins are surprisingly tolerant of amino 
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acid substitutions. The studies indicate which amino acid changes are likely to be 
permissive at a certain position of the protein. For example, most buried amino acid 
residues require nonpolar side chains, whereas few features of surface side chains are 
generally conserved. Other such phenotypically silent substitutions are described by 

5 Bowie et al. (supra) and the references cited therein. Typically seen as conservative 
substitutions are the replacements, one for another, among the aliphatic amino acids 
Ala, Val, Leu and lie; interchange of the hydroxy! residues Ser and Thr, exchange of 
the acidic residues Asp and Glu, substitution between the amide residues Asn and 
Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic 

10 residues Phe, Tyr. 

Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 
1 , or that encoded by the plaimds listed in Table 1 , may be: (i) one in which one or 
more of the amino acid residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid residue) and such substituted 

15 amino acid residue may or may not be one encoded by the genetic code; or (ii) one in 
which one or more of the amino acid residues includes a substituent group: or (hi) one 
in which the E.faecalis polypeptide is fused with another compound, such as a 
compound to increase the half-life of the polypeptide (for example, polyethylene 
glycol): or (iv) one in which the additional amino acids are fused to the above form of 

20 the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory 

sequence or a sequence which is employed for purification of the above form of the 
polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are 
deemed to be within the scope of those skilled in the art from the teachings herein. 

Thus, the E. faecalis polypeptides of the present invention may include one or 

25 more amino acid substitutions, deletions, or additions, either from natural mutations or 
human manipulation. As indicated, changes are preferably of a minor nature, such as 
conservative amino acid substitutions that do not significantly affect the folding or 
activity of the protein (see Table 3). 

Amino acids in the E.faecalis proteins of the present invention that are 
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essential for function can be identified by methods known in the art, such as site- 
directed mutagenesis or alanine-scanning mutagenesis. See, e.g., Cunningham et ah 
(1989) Science 244: 1 OS 1 - 1 085 . The latter procedure introduces single alanine 
mutations at ever/ residue m the molecule. The resulting mutant molecules are then 
5 tested for biological activity using assays appropriate for measuring the function of 
the particular protein. 

Of special interest arc substitutions of charged amino acids with other charged 
or neutral ammo acids which may produce proteins with highly desirable improved 
characteristics, such as less aggregation. Aggregation may not only reduce activity but 

10 also be problematic when preparing pharmaceutical formulations, because aggregates 
can be immunogenic. See, e.g., Pinckard et aL (1967) Clin. Exp. Immunol. 2:331-340; 
Robbins, et aL (1987) Diabetes 36:838-845; Cleland, et al., (1993) Crit. Rev. 
Therapeutic Drug Carrier Systems 10:307-377. 

The polypeptides of the present invention are preferably provided in an 

15 isolated form, and preferably are substantially purified. A recombinant^ produced 
version of the E. faecalis polypeptide can be substantially purified by the one-step 
method described by Smith et ah (1988) Gene 67:31-40. Polypeptides of the 
invention also can be purified from natural or recombinant sources using antibodies 
directed against the polypeptides of the invention in methods which are well known in 

20 the art of protein purification. 

The invention further provides for isolated E. faecalis polypeptides 
comprising an amino acid sequence selected from the group consisting of: (a) the 
ammo acid sequence of a fulMength E. faecalis polypeptide having the complete 
amino acid sequence shown in Table 1 ; (b) the ammo acid sequence of a full-length E. 

25 faecalis polypeptide having the complete amino acid sequence shown in Table 1 

excepting the N-terminal methionine; (c) the complete ammo acid sequence encoded 
by the plaimds listed in Table 1 ; and (d) the complete amino acid sequence excepting 
the N-terminal methionine encoded by the plaimds listed in Table 1 . The 
polypeptides of the present invention also include polypeptides having an amino acid 
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sequence at least 80% identical, more preferably at least 90% identical, and still more 
preferably 95%, 96%, 97%, 98% or 99% identical to those described in (a), (b), (c), 
and (d) above. 

Further polypeptides of the present invention include polypeptides which 
5 have at least 90% similarity, more preferably at least 95% similarity, and still more 
preferably at least 96%, 97%, 98% or 99% similarity to those described above. 

A further embodiment of the invention relates to a polypeptide which 
comprises the amino acid sequence of a E. faecalis polypeptide having an amino acid 
sequence which contains at least one conservative ammo acid substitution, but not 

in more than 50 conservative amino acid substitutions, not more than 40 conservative 
amino acid substitutions, not more than 30 conservative ammo acid substitutions, and 
not more than 20 conservative amino acid substitutions. Also provided are 
polypeptides which comprise the amino acid sequence of a E. faecalis polypeptide, 
having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino 

15 acid substitutions. 

By a polypeptide having an amino acid sequence at least, for example, 95% 
"identical" to a query amino acid sequence of the present invention, it is intended that 
the amino acid sequence of the subject polypeptide is identical to the query sequence 
except that the subject polypeptide sequence may include up to five amino acid 

20 alterations per each 100 amino acids of the query amino acid sequence. In other 

words, to obtain a polypeptide having an amino acid sequence at least 95% identical 
to a query amino acid sequence, up to 5% of the amino acid residues in the subject 
sequence may be inserted, deleted, (indels) or substituted with another amino acid. 
These alterations of the reference sequence may occur at the amino or carboxy 

25 terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference 
sequence or in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 90%, 
95%, 96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences 
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shown in Table 1 or to the ammo acid sequence encoded by the pkumds listed in Table 
1 can be determined conventionally using known computer programs. A preferred 
method tor determining the best overall match between a query sequence i a sequence 
of the present invention) and a subject sequence, also referred to as a global sequence 
alignment, can be determined using the FASTDB computer program based on the 
algorithm of Brutlag et ah. (1990) Comp. App. Biosci. 6.237-245. In a sequence 
alignment the query and subject sequences are both ammo acid sequences. The result 
of said global sequence alignment is in percent identity. Preferred parameters used in a 
FASTDB amino acid alignment arc: Matrix=PAM 0, k-tuple=2. Mismatch 
Penalty=l. Joining Penalty-20, Randomization Group Length^O, Cutoff Scorc=l, 
Window Size-sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window 
Size-500 or the length of the subject ammo acid sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C- 
termmal deletions, not because of internal deletions, the results, in percent identity, 
must be manually corrected. This is because the FASTDB program does not account 
for N- and C-termmal truncations of the subject sequence when calculating global 
percent identity. For subject sequences truncated at the N- and C-termmi, relative to 
the query sequence, the percent identity is corrected by calculating the number of 
residues of the query sequence that arc N- and C-terminal of the subject sequence, 
which arc not matched/aligned with a corresponding subject residue, as a percent of 
the total bases of the query sequence. Whether a residue is matched/aligned is 
determined by results of the FASTDB sequence alignment. This percentage is then 
subtracted from the percent identity, calculated by the above FASTDB program using 
the specified parameters, to arrive at a final percent identity score. This final percent 
identity score is what is used for the purposes of the present invention. Only 
residues to the N- and C- termini of the subject sequence, which are not 
matched/aligned with the query sequence, are considered for the purposes of manually 
adjusting the percent identity score. That is, only query amino acid residues outside 
the farthest N- and C-terminal residues of the subject sequence. 
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For example, a 90 amino acid residue subject sequence is aligned with a 100 
residue query sequence to determine percent identity. The deletion occurs at the N- 
terminus of the subject sequence and therefore, the FASTDB alignment does not 
match/align with the first 10 residues at the N-terminus. The 10 unpaired residues 

5 represent 10% of the sequence (number of residues at the N- and C- termini not 
matched/total number of residues in the query sequence) so 10% is subtracted from 
the percent identity score calculated by the FASTDB program, If the remaining 90 
residues were perfectly matched the Final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query 

10 sequence. This time the deletions are internal so there are no residues at the N- or C- 
termini of the subject sequence which are not matched/aligned with the query. In this 
case the percent identity calculated by FASTDB is not manually corrected. Once 
again, only residue positions outside the N- and C-terminal ends of the subject 
sequence, as displayed in the FASTDB alignment, which are not matched/aligned 

15 with the query sequence are manually corrected. No other manual corrections are to 
made for the purposes of the present invention. 

The above polypeptide sequences are included irrespective of whether they 
have their normal biological activity. This is because even where a particular 
polypeptide molecule does not have biological activity, one of skill in the art would 

20 still know how to use the polypeptide, for instance, as a vaccine or to generate 

antibodies. Other uses of the polypeptides of the present invention that do not have 
E.faecalis activity include, inter alia, as epitope tags, in epitope mapping, and as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods known to those of skill in the art. 

25 As described below, the polypeptides of the present invention can also be 

used to raise polyclonal and monoclonal antibodies, which are useful in assays for 
detecting E.faecalis protein expression or as agonists and antagonists capable of 
enhancing or inhibiting E. faecalis protein function. Further, such polypeptides can be 
used in the yeast two-hybrid system to "capture" E. faecalis protein binding proteins 



WO 98/50554 



PCT/US98, 08959 



which are also candidate agonists and antagonists according to the present invention. 
See, e.g., Fields ct al. i 19S9) Nature 340:245-246. 



Epitope-Bearing Portions 

ln another aspect, the invention provides peptides and polypeptides 
composing epitope-bearing portions of the E. faccalis polypeptides of the present 
invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides 
of the present invention. An "immunogenic epitope" is defined as a part of a protein 
that elicits an antibody response when the whole protein or polypeptide is the 
0 immunogen. These immunogenic epitopes are believed to be confined to a few loci on 
the molecule. On the other hand, a region of a protein molecule to which an antibody 
can bind is defined as an "antigenic determinant" or "antigenic epitope." The number 
of immunogenic epitopes of a protein generally is less than the number of antigenic 
epitopes. See, e.g., Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002. 
Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 
4 only lists amino acid residues comprising epitopes predicted to have the highest 
degree of antigenicity. The polypeptides not listed in Table 4 and portions of 
polypeptides not listed in Table 4 are not considered non-antigenic. This is because 
they may still be antigenic in vivo but merely not recognized as such by the particular 
20 algorithm used. Thus, Table 4 lists the amino acid residues comprising preferred 
antigenic epitopes but not a complete list. Amino acid residues comprising other 
anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf 
analysis or by in vivo testing for an antigenic response using the methods described 
herein or those known in the art. 
25 As t0 the selection of peptides or polypeptides bearing an antigenic epitope 

(i.e., that contain a region of a protein molecule to which an antibody can bind), it is 
well known in that art that relatively short synthetic peptides that mimic part of a 
protein sequence are routinely capable of eliciting an antiserum that reacts with the 
partially mimicked protein. See, e.g., Sutcliffe, et al., (1983) Science 219:660-666. 



15 
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Peptides capable of eliciting protein-reactive sera are frequently represented in the 
primary sequence of a protein, can be characterized by a set of simple chemical rules, 
and are confined neither to immunodominant regions of intact proteins (i.e., 
immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are 
5 extremely hydrophobic and those of six or fewer residues generally are ineffective at 
inducing antibodies that bind to the mimicked protein; longer, peptides, especially 
those containing proline residues, usually are effective. See, Sutcliffe, et ah, supra f p. 
661. For instance, 18 of 20 peptides designed according to these guidelines, containing 
S-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HA1 

10 polypeptide chain, induced antibodies that reacted with the HA1 protein or intact 
virus; and 12/12 peptides from the MuLV polymerase and 1 8/18 from the rabies 
glycoprotein induced antibodies that precipitated the respective proteins. 

Antigenic epitope-bearing peptides and polypeptides of the invention are 
therefore useful to raise antibodies, including monoclonal antibodies, that bind 

15 specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas 
obtained by fusion of spleen cells from donors immunized with an antigen 
epitope-bearing peptide generally secrete antibody reactive with the native protein. 
See Sutcliffe, et al, supra, p, 663. The antibodies raised by antigenic epitope-bearing 
peptides or polypeptides are useful to detect the mimicked protein, and antibodies to 

20 different peptides may be used for tracking the fate of various regions of a protein 
precursor which undergoes post-translational processing. The peptides and 
anti-peptide antibodies may be used in a variety of qualitative or quantitative assays 
for the mimicked protein, for instance in competition assays since it has been shown 
that even short peptides (e.g., about 9 amino acids) can bind and displace the larger 

25 peptides in immunoprecipitation assays. See, e.g., Wilson, et al., (1984) Cell 
37:767-778. The anti-peptide antibodies of the invention also are useful for 
purification of the mimicked protein, for instance, by adsorption chromatography 
using methods known in the art. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
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designed according to the above guidelines preferably contain a sequence of at least 
seven, more preferably at least nine and most preferably between about 10 to about 
50 ammo acids (i.e. any integer between 7 and 50) contained within the amino acid 
sequence of a polypeptide of the invention. However, peptides or polypeptides 
5 comprising a larger portion of an amino acid sequence of a polypeptide of the 
invention, containing about 50 to about 1 00 amino acids, or any length up to and 
including the entire amino acid sequence of a polypeptide of the invention, also are 
considered epitope-bearing peptides or polypeptides of the invention and also are 
useful for inducing antibodies that react with the mimicked protein. Preferably, the 

10 amino acid sequence of the epitope-bearing peptide is selected to provide substantial 
solubility in aqueous solvents (i.e., the sequence includes relatively hydrophilic 
residues and highly hydrophobic sequences are preferably avoided); and sequences 
containing proline residues are particularly preferred. 

Non-limiting examples of antigenic polypeptides or peptides that can be used 

15 to generate an enterococcal-specific immune response or antibodies include portions of 
the amino acid sequences identified in Table 1. More specifically, Table 4 discloses a 
list of non-limiting residues that are involved in the antigenicity of the epitope-bearing 
fragments of the present invention. Therefore, the present inventions provides for 
isolatd and purified antigenic epitope-bearing fragements of the polypeptides of the 

20 present invention comprising a peptide sequences of Table 4. The antigenic epitope- 
bearing fragments comprising a peptide sequence of Table 4 preferably contain a 
sequence of at least seven, more preferably at least nine and most preferably between 
about 10 to about 50 amino acids (i.e. any integer between 7 and 50) of a polypeptide 
of the present invention. That is, included in the present invention are antigenic 

25 polypeptides between the integers of 7 and 50 amino acid in length comprising one or 
more of the sequences of Table 4. Therefore, in most cases, the polypeptides of 
Table 4 make up only a portion of the antigenic polypeptide. All combinations of 
sequences between the integers of 7 and 50 ammo acid in length comprising one or 
more of the sequences of Table 4 are included. The antigenic epitope-bearing 
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fragemcnts may be specified by cither the number of contiguous amino acid residues 
or by specific N-terminal and C-terminal positions as described above for the 
polypeptide fragemcnts of the present invention, wherein the initiation codon is 
residue 1 . Any number of the described antigenic epitope-bearing fragements of the 
5 present invention may also be excluded from the present invention in the same 
manner. 

The epitope-bearing peptides and polypeptides of the invention may be 
produced by any conventional means for making peptides or polypeptides including 
recombinant means using nucleic acid molecules of the invention. For instance, an 

10 epitope-bearing amino acid sequence of the present invention may be fused to a larger 
polypeptide which acts as a carrier during recombinant production and purification, as 
well as during immunization to produce anti-peptide antibodies. Epitope-bearing 
peptides also may be synthesized using known methods of chemical synthesis. For 
instance, Houghtcn has described a simple method for synthesis of large numbers of 

15 peptides, such as 10-20 mg of 248 different 13 residue peptides representing single 
amino acid variants of a segment of the HA1 polypeptide which were prepared and 
characterized (by ELISA-typc binding studies) in less than four weeks (Houghten, R. 
A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple 
Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 4,631,21 1 

20 to Houghten and coworkers (1986). In this procedure the individual resins for the 

solid-phase synthesis of various peptides are contained in separate solvent-permeable 
packets, enabling the optimal use of the many identical repetitive steps involved in 
solid-phase methods. A completely manual procedure allows 500-1000 or more 
syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad. 

25 Sci. 82:5131-5135 at 5134. 

Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known in the art. See, e.g., Sutcliffc, et al, 
supra;; Wilson, et al., supra;; and Bittle, et al. (1985) J, Gen. Virol. 66:2347-2354. 
Generally, animals may be immunized with free peptide; however, anti-peptide 
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antibody liter may be boosted by coupling of the peptide to a macromolecular carrier, 
such as keyhole limpet hemacyanin (KLH i or tetanus toxoid. For instance, peptides 
containing cysteine may be coupled to carrier using a linker such as 
m-malcimidobenzoyl-N-hydroxysuccinimidc ester (MBS), while other peptides may 

5 be coupled to earner using a more general linking agent such as glutaraldehyde. 
Animals such as rabbits, rats and mice are immunized with either free or 
carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection 
of emulsions containing about 100 fjg peptide or carrier protein and Freund's adjuvant. 
Several booster injections may be needed, for instance, at intervals of about two 

10 weeks, to provide a useful titer of anti-peptide antibody which can be detected, for 
example, by EL1SA assay using free peptide adsorbed to a solid surface. The titer of 
anti-peptide antibodies in serum from an immunized animal may be increased by 
selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a 
solid support and clution of the selected antibodies according to methods well known 

15 in the art. 

Immunogenic epitopc-bcaring peptides of the invention, i.e., those parts of a 
protein that elicit an antibody response when the whole protein is the immunogen, are 
identified according to methods known in the art. For instance, Geysen, et ai, supra, 
discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of 

20 peptides of sufficient purity to react m an ELISA. Interaction of synthesized 

peptides with antibodies is then easily detected without removing them from the 
support. In this manner a peptide bearing an immunogenic epitope of a desired 
protein may be identified routinely by one of ordinary skill in the art. For instance, 
the immunologically important epitope in the coat protein of foot-and-mouth disease 

25 virus was located by Geysen et al supra with a resolution of seven amino acids by 
synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 
213 ammo acid sequence of the protein. Then, a complete replacement set of peptides 
in which all 20 amino acids were substituted in turn at every position within the 
epitope were synthesized, and the particular ammo acids conferring specificity for the 
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reaction with antibody were determined. Thus, peptide analogs of the epitope-beanng 
peptides of the invention can be made routinely by this method. U.S. Patent No. 
4,708,781 to Gcysen (1987) further describes this method of identifying a peptide 
bearing an immunogenic epitope of a desired protein. 
5 Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general 

method of detecting or determining the sequence of monomers (amino acids or other 
compounds) which is a topological equivalent of the epitope (i.e., a "mimotope") 
which is complementary to a particular paratope (antigen binding site) of an antibody 
of interest. More generally, U.S. Patent No. 4,433,092, also to Geysen (1989), 

10 describes a method of detecting or determining a sequence of monomers which is a 

topographical equivalent of a ligand which is complementary to the ligand binding site 
of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, 
R. A. et al. (1996) discloses linear C r C 7 -alkyl peralkylated oligopeptides and sets and 
libraries of such peptides, as well as methods for using such oligopeptide sets and 

15 libraries for determining the sequence of a peralkylated oligopeptide that preferentially 
binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitopc-bearing peptides of the invention also can be made routinely by these 
methods. The entire disclosure of each document cited in this section on 
"Polypeptides and Fragments" is hereby incorporated herein by reference. 

20 As one of skill in the art will appreciate, the polypeptides of the present 

invention and the epitope-bearing fragments thereof described above can be combined 
with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric 
polypeptides. These fusion proteins facilitate purification and show an increased 
half-life in vivo. This has been shown, e.g., for chimeric proteins consisting of the 

25 first two domains of the human CD4-polypeptide and various domains of the 

constant regions of the heavy or light chains of mammalian immunoglobulins. (EPA 
0,394,827; Traunecker et al. (1988) Nature 331 :84-86. Fusion proteins that have a 
disulfidc-linked dimeric structure due to the IgG part can also be more efficient in 
binding and neutralizing other molecules than a monomelic E.faecalis polypeptide or 
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fragment thereof alone. Sec Fountoulakis el al. (1995) J. Biochem. 270:3958-3964. 
Nucleic acids encoding the above epitopes of E.faccalis polypeptides can also be 
recombined with a gene of interest as an epitope tag to aid in detection and 
purification of the expressed polypeptide. 

Antibodies 

E. faccalis protein-specific antibodies for use in the present invention can be 
raised against the intact E.faccalis protein or an antigenic polypeptide fragment 
thereof, which may be presented together with a earner protein, such as an albumin, to 
an animal system (such as rabbit or mouse) or. if it is long enough (at least about 25 
ammo acids), without a carrier. 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
meant to include intact molecules, single chain whole antibodies, and antibody 
fragments. Antibody fragments of the present invention include Fab and F(ab')2 and 
other fragments including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv). 
Also included in the present invention are chimeric and humanized monoclonal 
antibodies and polyclonal antibodies specific for the polypeptides of the present 
invention. The antibodies of the present invention may be prepared by any of a 
variety of methods. For example, cells expressing a polypeptide of the present 
invention or an antigenic fragment thereof can be administered to an animal in order to 
induce the production of sera containing polyclonal antibodies. For example, a 
preparation of E. faecalis polypeptide or fragment thereof is prepared and purified to 
render it substantially free of natural contaminants. Such a preparation is then 
introduced into an animal in order to produce polyclonal antisera of greater specific 
activity. 

In a preferred method, the antibodies of the present invention are monoclonal 
antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared 
using hybridoma technology. See, e.g., Harlow et al., ANTIBODIES: A 
LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); 
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Hammcrlmg, et al., in: MONOCLONAL ANTIBODIES AND T-CELL 
HYBRJDOMAS 563-681 (Elsevier, N.Y., 1981). Fab and F(ab')2 fragments may be 
produced by proteolytic cleavage, using enzymes such as papain (to produce Fab 
fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, E.faecalis 
5 polypeptide-binding fragments, chimeric, and humanized antibodies can be produced 
through the application of recombinant DNA technology or through synthetic 
chemistry using methods known in the art. 

Alternatively, additional antibodies capable of binding to the polypeptide 
antigen of the present invention may be produced in a two-step procedure through the 

10 use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies 
are themselves antigens, and that, therefore, it is possible to obtain an antibody which 
binds to a second antibody. In accordance with this method, E.faecalis 
polypeptide-specific antibodies are used to immunize an animal, preferably a mouse. 
The splcnocytes of such an animal are then used to produce hybridoma cells, and the 

15 hybridoma cells are screened to identify clones which produce an antibody whose 

ability to bind to the £. faecalis polypeptide-specific antibody can be blocked by the 
E.faecalis polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to 
the E.faecalis polypeptide-specific antibody and can be used to immunize an animal 
to induce formation of further E.faecalis polypeptide-specific antibodies. 

20 Antibodies and fragements thereof of the present invention may be described 

by the portion of a polypeptide of the present invention recognized or specifically 
bound by the antibody. Antibody binding fragements of a polypeptide of the present 
invention may be described or specified in the same manner as for polypeptide 
fragements discussed above, i.e, by N-terminal and C-terminal positions or by size in 

25 contiguous amino acid residues. Any number of antibody binding fragments, of a 
polypeptide of the present invention, specified by N-terminal and C-terminal 
positions or by size in amino acid residues, as described above, may also be excluded 
from the present invention. Therefore, the present invention includes antibodies the 
specifically bind a particuarlly discribed fragement of a polypeptide of the present 
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invention and allow? tor the exclusion of the same. 

Antibodies and fragements thereof of the present invention may also be 
described or specified in terms of their cross-reactivity. Antibodies and fragements 
that do not bind polypeptides of any other species of Enterococcus other than E. 
5 faccalis are included in the present invention. Likewise, antibodies and fragements 

that bind only species of Enterococcus. i.e. antibodies and fragements that do not bind 
bacteria from any genus other than Enterococcus , are included in the present 
invention. 

1 0 D iagn ostic A ss ays 

The present invention further relates to methods for assaying staphylococcal 
infection m an animal by detecting the expression of genes encoding staphylococcal 
polypeptides of the present invention. The methods comprise analyzing tissue or 
body fluid from the animal for Entcrococcus-specific antibodies, nucleic acids, or 

15 proteins. Analysis of nucleic acid specific to Enterococcus is assayed by PCR or 
hybridization techniques using nucleic acid sequences of the present invention as 
either hybridization probes or primers. See, e.g., Sambrook et al. Molecular cloning: 
A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd cd., 1 989, page 54 
reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing 

20 differentiation among spotted fever group Rickettsiae species by analysis of restriction 
fragment length polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin. 
Microbiol. 32:589-595 (detecting B. burgdorferi nucleic acids via PCR). 

Where diagnosis of a disease state related to infection with Enterococcus has 
already been made, the present invention is useful for monitoring progression or 

25 regression of the disease state whereby patients exhibiting enhanced Enterococcus 

gene expression wall experience a worse clinical outcome relative to patients expressing 
these gene(s) at a lower level. 

By "biological sample" is intended any biological sample obtained from an 
animal, cell line, tissue culture, or other source which contains Enterococcus 
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polypeptide, mRNA, or DNA. Biological samples include body fluids (such as saliva, 
blood, plasma, urine, mucus, synovial fluid, etc.) tissues (such as muscle, skin, and 
cartilage ) and any other biological source suspected of containing Enterococcus 
polypeptides or nucleic acids. Methods for obtaining biological samples such as 

5 tissue are well known in the art. 

The present invention is useful for detecting diseases related to Enterococcus 
infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, 
cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans. 
Total RNA can be isolated from a biological sample using any suitable 

10 technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method 
described in Chomczynski et ah (1987) Anal. Biochem. 162:156-159. mRNA encoding 
Enterococcus polypeptides having sufficient homology to the nucleic acid sequences 
identified in Table 1 to allow for hybridization between complementary sequences are 
then assayed using any appropriate method. These include Northern blot analysis, SI 

15 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in 

combination with the polymerase chain reaction (RT-PCR), and reverse transcription 
in combination with the ligase chain reaction (RT-LCR). 

Northern blot analysis can be performed as described in Harada et ah (1990) 
Cell 63:303-3 12. Briefly, total RNA is prepared from a biological sample as described 

20 above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as 
glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel 
electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been 
linked to the filter by a UV linker, the filter is prehybridized in a solution containing 
formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium 

25 phosphate buffer. A E.faecalis polynucleotide sequence shown in Table 1 labeled 
according to any appropriate method ( such as the 32 P-multiprimed DNA labeling 
system (Amersham)) is used as probe. After hybridization overnight, the filter is 
washed and exposed to x-ray film. DNA for use as probe according to the present 
invention is described in the sections above and will preferably at least 15 nucleotides 
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in length. 

SI mapping can be performed as described in Fujita ct al. (1987) Cell 
49:?5" T -?6"\ To prepare probe DNA for use in SI mapping, the sense strand of an 
above-described E. faecalis DNA sequence of the present invention is used as a 
5 template to synthesize labeled antiscnse DNA. The antisensc DNA can then be 
digested using an appropriate restriction endonuciease to generate further DNA 
probes of a desired length. Such antiscnse probes are useful for visualizing protected 
bands corresponding to the target mRNA (i.e.. mRNA encoding Enierococcus 
polypeptides). 

10 Levels of mRNA encoding Enierococcus polypeptides are assayed, for e.g., 

using the RT-PCR method described in Makino et ah (1990) Technique 2:295-301. 
By this method, the radioactivities of die "amplicons" in the polyacrylamide gel bands 
are linearly related to the initial concentration of the target mRNA. Briefly, this 
method involves adding total RNA isolated from a biological sample in a reaction 
15 mixture containing a RT primer and appropriate buffer. After incubating for primer 
annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase 
inhibitor and reverse transcriptase. After incubation to achieve reverse transcription 
of the RNA, the RT products are then subject to PCR using labeled primers. 
Alternatively, rather than labeling the primers, a labeled dNTP can be included in the 
20 PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler 
according to conventional techniques. After a suitable number of rounds to achieve 
amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel. 
After drying the gel, the radioactivity of the appropriate bands (corresponding to the 
mRNA encoding the Enierococcus polypeptides of the present invention) are 
25 quantified using an imaging analyzer. RT and PCR reaction ingredients and 

conditions, reagent and gel concentrations, and labeling methods are well known in the 
art. Variations on the RT-PCR method will be apparent to the skilled artisan. Other 
PCR methods that can detect the nucleic acid of the present invention can be found in 
PCR PRIMER; A LABORATORY MANUAL (CAV. Dieffenbach et al. eds., Cold 
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Spring Harbor Lab Press, 1995). 

The polynucleotides of the present invention, including both DNA and RNA, 
may be used to detect polynucleotides of the present invention or Enterococcal 
species including E.faecalis using bio chip technology. The present invention 

5 includes both high density chip arrays (> 1 000 oligonucleotides per cm 2 ) and low 

density chip arrays (< 1 000 oligonucleotides per cm 2 ). Bio chips comprising arrays of 
polynucleotides of the present invention may be used to detect Enterococcal species, 
including E.faecalis, in biological and environmental samples and to diagnose an 
animal, including humans, with an E.faecalis or other Enterococcal infection. The bio 

10 chips of the present invention may comprise polynucleotide sequences of other 

pathogens including bacteria, viral, parasitic, and fungal polynucleotide sequences, in 
addition to the polynucleotide sequences of the present invention, for use in rapid 
diffenertial pathogenic detection and diagnosis. The bio chips can also be used to 
monitor an E.faecalis or other Enterococcal infections and to monitor the genetic 

15 changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the 
clinic and drug development in the laboratory. The bio chip technology comprising 
arrays of polynucleotides of the present invention may also be used to simultaneously 
monitor the expression of a multiplicity of genes, including those of the present 
invention. The polynucleotides used to comprise a selected array may be specified in 

20 the same manner as for the fragements, i.e, by their 5' and 3' positions or length in 
contigious base pairs and include from. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcal species, including E. 
faecalis, using bio chip technology include those known in the art and those of: U.S. 
Patent Nos. 5510270, 5545531, 5445934,5677195, 5532128, 5556752, 5527681, 

25 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, 
WO/95 1 1995, WO/9743447, WO/9535505, each incorporated herein in their 
entireties. 

Biosensors using the polynucleotides of the present invention may also be 
used to detect, diagnose, and monitor E. faecalis or other Enterococcal species and 
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infections thereof. Biosensors using the polynucleotides of the present invention mav 
also be used to deteet particular polynucleotides of the present invention. Biosensors 
using the polynucleotides of the present invention may also be used to monitor the 
genetic changes (deletions, insertions, mismatches, etc. ) in response to drug therapy in 
f the clinic and drug development in the laboratory. Methods and particular uses of the 
polynucleotides of the present invention to detect Enterococcai species, including E. 
taecalis. using bioscnors include those known in the art and those of: U.S. Patent Nos 
5721 102. 5658732. 5631 170, and World Patent Nos. WO97/3501 1, WO/9720203, 
each incorporated herein in their entireties. 

10 Thus, the present invention includes both bio chips and biosensors comprising 

polynucleotides of the present invention and methods of their use. 

Assaying Enterococcus polypeptide levels in a biological sample can occur 
using any art-known method, such as antibody-based techniques. For example, 
Enterococcus polypeptide expression in tissues can be studied with classical 

15 inimunohistological methods. In these, the specific recognition is provided by the 

primary antibody (polyclonal or monoclonal) but the secondary detection system can 
utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an 
immunohistological staining of tissue section for pathological examination is obtained. 
Tissues can also be extracted, e.g., with urea and neutral detergent, for the liberation of 

20 Enterococcus polypeptides for Wcstcrn-blot or dot/slot assay. See, e.g., Jalkanen, M. 
et al. (1985) J. Cell. Biol. 101:976-985; Jalkanen, M. et ah (1987) J. Cell . Biol. 
105:3087-3096. In this technique, which is based on the use of cationic solid phases, 
quantitation of a Enterococcus polypeptide can be accomplished using an isolated 
Enterococcus polypeptide as a standard. This technique can also be applied to body 

25 fluids. 

Other antibody-based methods useful for detecting Enterococcus polypeptide 
gene expression include immunoassays, such as the ELISA and the radioimmunoassay 
(RIA). For example, a Enterococcus polypeptide-specific monoclonal antibodies can 
be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and 
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quantify a Enterococcus polypeptide. The amount of a Enterococcus polypeptide 
present in the sample can be calculated by reference to the amount present in a 
standard preparation using a linear regression computer algorithm. Such an ELISA is 
described in lacobelli et al. (1988) Breast Cancer Research and Treatment 1 1:19-30. In 
5 another ELISA assay, two distinct specific monoclonal antibodies can be used to 

detect Enterococcus polypeptides in a body fluid. In this assay, one of the antibodies 
is used as the immunoabsorbent and the other as the enzyme-labeled probe. 

The above techniques may be conducted essentially as a "one-step" or 
"two-step" assay. The "one-step" assay involves contacting the Enterococcus 
10 polypeptide with immobilized antibody and, without washing, contacting the mixture 
with the labeled antibody. The "two-step" assay involves washing before contacting 
the mixture with the labeled antibody. Other conventional methods may also be 
employed as suitable. It is usually desirable to immobilize one component of the 
assay system on a support, thereby allowing other components of the system to be 
15 brought into contact with the component and readily removed from the sample. 
Variations of the above and other immunological methods included in the present 
invention can also be found m Harlow et al., ANTIBODIES: A LABORATORY 
MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). 

Suitable enzyme labels include, for example, those from the oxidase group, 
20 which catalyze the production of hydrogen peroxide by reacting with substrate. 
Glucose oxidase is particularly preferred as it has good stability and its substrate 
(glucose) is readily available. Activity of an oxidase label may be assayed by 
measuring the concentration of hydrogen peroxide formed by the enzyme-labeled 
antibody/substrate reaction. Besides enzymes, other suitable labels include 
25 radioisotopes, such as iodine ( 125 I, 121 1), carbon ( 14 C), sulphur ( 35 S ), tritium ( 3 H), 

indium ( 112 In), and technetium ( 99m Tc), and fluorescent labels, such as fluorescein and 
rhodamine, and biotin. 

Further suitable labels for the Enterococcus polypeptide-specific antibodies of 
the present invention are provided below. Examples of suitable enzyme labels include 
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malate dehydrogenase, Enteroeoccal nuclease, delta-5-steroid isomerase, yeast-alcohol 
dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, 
peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, 
ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and 
5 acetylcholine esterase. 

Examples of suitable radioisotopic labels include 3 H, 1H ln, 125 I, l3l I, 32 P, 35 S, 
i4 C, 51 Cr, 5? To, "Co, 59 Fe, 75 Se, ]52 Eu, 9U Y, 67 Cu, 217 Ci, 211 At, 212 Pb, 47 Sc, 109 Pd, etc. 
111 In is a preferred isotope where in vivo imaging is used since its avoids the problem 
of dehalogenation of the 125 1 or 131 I-labeled monoclonal antibody by the liver. In 
10 addition, this radionucleotide has a more favorable gamma emission energy for imaging. 
See, e.g., Perkins el al. (1985) Eur. J. Nucl. Med, 10:296-301; Carasquillo et al. 
(1987) J, Nucl. Med. 28:281-287. For example, lu In coupled to monoclonal 
antibodies with 1 -(P-isothiocyanatobenzyl)-DPTA has shown little uptake in 
non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 
15 localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870. 

Examples of suitable non-radioactive isotopic labels include ,57 Gd, 55 Mn, 
162 Dy, 52 Tr, and 56 Fe. 

Examples of suitable fluorescent labels include an 152 Eu label, a fluorescein 
label, an isothiocyanate label a rhodamme label, a phycoerythrin label, a phycocyanin 
20 label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label. 

Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, 
ricin, and cholera toxin. 

Examples of chcmiluminescent labels include a luminal label, an isoluminal 
label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an 
25 oxalate ester label, a luciferin label, a luciferase label, and an aequorin label. 

Examples of nuclear magnetic resonance contrasting agents include heavy metal 
nuclei such as Gd, Mn, and iron. 

Typical techniques for binding the above-described labels to antibodies are 
provided by Kennedy et al (1976) Clin. Chim. Acta 70:1-31, and Schurs et al (1977) 
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Clin. Chini. Acta 81 : 1-40. Coupling techniques mentioned in the latter are the 
glutaraldehvde method, the penodate method, the dimaleimide method, the 
nvnuilennidobenzyl^N-hydroxy-succmimide ester method, all of which methods are 
incorporated by reference herein. 

5 In a related aspect, the invention includes a diagnostic kit for use in screening 

scrum containing antibodies specific against E.faecalis infection. Such a kit may 
include an isolated E.faecalis antigen comprising an epitope which is specifically 
immunoreactive with at least one nnii-E. faecalis antibody. Such a kit also includes 
means for detecting the binding of said antibody to the antigen. In specific 

10 embodiments, the kit may include a recombinantly produced or chemically 

synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen 
may be attached to a solid support. 

In a more specific embodiment, the detecting means of the above-descnbed kit 
includes a solid support to which said peptide or polypeptide antigen is attached. 

1 5 Such a kit may also include a non-attached reporter-labeled anti-human antibody. In 
this embodiment, binding of the antibody to the E.faecalis antigen can be detected by 
binding of the reporter labeled antibody to the anti-£. faecalis polypeptide antibody, 

In a related aspect, the invention includes a method of detecting E. faecalis 
infection in a subject. This detection method includes reacting a body fluid, preferably 

20 serum, from the subject with an isolated E.faecalis antigen, and examining the antigen 
for the presence of bound antibody. In a specific embodiment, the method includes a 
polypeptide antigen attached to a solid support, and serum is reacted with the 
support. Subsequently, the support is reacted with a reporter-labeled anti-human 
antibody. The support is then examined for the presence of reporter-labeled 

25 antibody. 

The solid surface reagent employed in the above assays and kits is prepared 
by known techniques for attaching protein material to solid support material, such as 
polymeric beads, dip sticks, 96-well plates or filter material. These attachment 
methods generally include non-specific adsorption of the protein to the support or 



WO 98/50554 



-51- 



rCT7US98/08959 



covalcnt attachment of the protein , typically through a free amine group, to a 
chemically reactive group on the solid support, such as an activated carboxyl, 
hydroxyL or aldehyde group. Alternatively, streptavidin coated plates can be used in 
conjunction with biotinylated antigen(s). 
5 The polypeptides and antibodies of the present invention, including fragments 

thereof, may be used to detect Entcrococcal species including E.faecalis using bio chip 
and biosensor technology. Bio chip and biosensors of the present invention may 
comprise the polypeptides of the present invention to detect antibodies, which 
specifically recognize Entcrococcal species, including E.faecalis. Bio chip and 
10 biosensors of the present invention may also comprise antibodies which specifically 
recognize the polypeptides of the present invention to detect Entcrococcal species, 
including E. faecalis or specific polypeptides of the present invention. Bio chips or 
biosensors comprising polypeptides or antibodies of the present invention may be 
used to detect Entcrococcal species, including E.faecalis, in biological and 
15 environmental samples and to diagnose an animal, including humans, with an E. 

faecalis or other Enterococcal infection. Thus, the present invention includes both bio 
chips and biosensors comprising polypeptides or antibodies of the present invention 
and methods of their use. 

The bio chips of the present invention may further comprise polypeptide 
20 sequences of other pathogens including bacteria, viral, parasitic, and fungal 

polypeptide sequences, in addition to the polypeptide sequences of the present 
invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio 
chips of the present invention may further comprise antibodies or fragements thereof 
specific for other pathogens including bacteria, viral, parasitic, and fungal polypeptide 
25 sequences, in addition to the antibodies or fragements thereof of the present invention, 
for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and 
biosensors of the present invention may also be used to monitor an E.faecalis or other 
Enterococcal infection and to monitor the genetic changes (amio acid deletions, 
insertions, substitutions, etc.) in response to drug therapy in the clinic and drug 
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development in the laboratory. The bio chip and biosensors comprising polypeptides 
or antibodies of the present invention may also be used to simultaneously monitor the 
expression of a multiplicity of polypeptides, including those of the present invention. 
The polypeptides used to comprise a bio chip or biosensor of the present invention 

5 may be specified in the same manner as for the fragemcnts. i.e. by their N -terminal and 
C-terminal positions or length in conligious ammo acid residue. Methods and 
particular uses of the polypeptides and antibodies of the present invention to detect 
Enterococcal species, including E.faecalis* or specific polypeptides using bio chip and 
biosensor technology include those known in the art, those of the U.S. Patent Nos. 

10 and World Patent Nos. listed above for bio chips and biosensors using 

polynucleotides of the present invention, and those of: U.S. Patent Nos. 5658732, 
5135852, 5567301, 5677196, 5690894 and World Patent Nos. W09729366, 
W096 12957, each incorporated herein in their entireties. 

1 5 Treatment: 

Agonists and Antagonists - Assays and Molecules 

The invention also provides a method of screening compounds to identify 
those which enhance or block the biological activity of the E.faecalis polypeptides of 
the present invention. The present invention further provides where the compounds 
20 kill or slow the growth of E. faecalis. The ability of E. faecalis antagonists, including 
E.faecalis ligands, to prophylactically or therapeutically block antibiotic resistance 
may be easily tested by the skilled artisan. See, e.g., Straden et al. (1997) J Bacterid. 
179U):9-16. 

An agonist is a compound which increases the natural biological function or 
25 which functions in a manner similar to the polypeptides of the present invention, 

while antagonists decrease or eliminate such functions. Potential antagonists include 
small organic molecules, peptides, polypeptides, and antibodies that bind to a 
polypeptide of the invention and thereby inhibit or extinguish its activity. 

The antagonists may be employed for instance to inhibit peptidoglycan cross 
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bndgc formation. Antibodies against E.faecalis may be employed to bind to and 
inhibit E. faecalis activity to treat antibiotic resistance. Any of the above antagonists 
may be employed in a composition with a pharmaceutical^ acceptable carrier. 

5 Vaccines 

The present invention also provides vaccines comprising one or more 

polypeptides of the present invention. Heterogeneity in the composition of a vaccine 

may be provided by combining E.faecalis polypeptides of the present invention. 

Multi-component vaccines of this type are desirable because they are likely to be 
10 more effective in eliciting protective immune responses against multiple species and 

strains of the Entcrococcus genus than single polypeptide vaccines. 

Multi-component vaccines are known in the art to elicit antibody production 

to numerous immunogenic components. See, e.g., Decker et al. (1996) J. Infect. Dis. 

174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent 
1 5 vaccine has recently been demonstrated to elicit protective levels of antibodies in 

human infants against all four pathogenic agents. See, e.g., Aristegui, J. et al. (1997) 

Vaccine 15:7-9. 

The present invention in addition to single-component vaccines includes 
multi-component vaccines. These vaccines comprise more than one polypeptide, 
20 immunogen or antigen. Thus, a multi-component vaccine would be a vaccine 

comprising more than one of the E.faecalis polypeptides of the present invention. 

Further within the scope of the invention are whole cell and whole viral 
vaccines. Such vaccines may be produced recombinantly and involve the expression 
of one or more of the E. faecalis polypeptides described in Table 1 . For example, the 
25 E, faecalis polypeptides of the present invention may be either secreted or localized 
intracellular, on die cell surface, or in the periplasmic space. Further, when a 
recombinant virus is used, the E.faecalis polypeptides of the present invention may, 
for example, be localized in the viral envelope, on the surface of the capsid, or 
internally within the capsid. Whole cells vaccines which employ cells expressing 
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heterologous proteins are known in the art. See, e.g., Robinson. K. et al. (1997) 
Nature Biotech. 15:653-657; Sirard. J. et al. (199") Infect. Immun. 65:2029-2033; 
Chabalgoity, J. et al. (1997) Infect. Immun. 65:2402-2412 . These cells may be 
administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, 
5 for example, report the successful use in mice of a live attenuated Salmonella vaccine 
strain which expresses a portion of a platyhclminth fatty acid-binding protein as a 
fusion protein on its cells surface. 

A multi-component vaccine can also be prepared using techniques known in 
the art by combining one or more E. faecalis polypeptides of the present invention, or 
10 fragments thereof, with additional non-Enterococcal components (e.g., diphtheria 

toxin or tetanus toxin, and/or other compounds known to elicit an immune response). 
Such \accines are useful for eliciting protective immune responses to both members of 
the Enterococcus genus and non-Enterococcal pathogenic agents. 

The vaccines of the present invention also include DNA vaccines. DNA 
15 vaccines are currently being developed for a number of infectious diseases. See, et al, 
Boyer, et al. (1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 
14:1285-1288. Such DNA vaccines contain a nucleotide sequence encoding one or 
more E. faecalis polypeptides of the present invention oriented in a manner that 
allows for expression of the subject polypeptide. For example, the direct 
20 administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to 
elicit protective immunity in mice against borrelial challenge. See, Luke et al. (1997) J. 
Infect. Dis. 175:91-97. 

The present invention also relates to the administration of a vaccine which is 
co-administered with a molecule capable of modulating immune responses. Kim et al. 
25 (1997) Nature Biotech. 15:641-646, for example, report the enhancement of immune 
responses produced by DNA immunizations when DNA sequences encoding 
molecules which stimulate the immune response are co-administered. In a similar 
fashion, the vaccines of the present invention may be co-administcred with either 
nucleic acids encoding immune modulators or the immune modulators themselves. 
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These immune modulators include granulocyte macrophage colony stimulating factor 
(GM-CSF) and CD86. 

The vaccines of the present invention may be used to confer resistance to 
Enterococcal infection by either passive or active immunization. When the vaccines of 
5 the present invention are used to confer resistance to Enterococcal infection through 
active immunization, a vaccine of the present invention is administered to an animal to 
elicit a protective immune response which either prevents or attenuates a Enterococcal 
infection. When the vaccines of the present invention are used to confer resistance to 
Enterococcal infection through passive immunization, the vaccine is provided to a host 
10 animal (e.g. , human, dog, or mouse), and the antisera elicited by this antisera is 

recovered and directly provided to a recipient suspected of having an infection caused 
by a member of the Enterococcus genus. 

The ability to label antibodies, or fragments of antibodies, with toxin molecules 
provides an additional method for treating Enterococcal infections when passive 
15 immunization is conducted. In this embodiment, antibodies, or fragments of 

antibodies, capable of recognizing the E.faecalis polypeptides disclosed herein, or 
fragments thereof, as well as other Enterococcus proteins, are labeled with toxin 
molecules prior to their administration to the patient. When such toxin derivatized 
antibodies bind to Enterococcus cells, toxin moieties will be localized to these cells and 
20 will cause their death. 

The present invention thus concerns and provides a means for preventing or 
attenuating a Enterococcal infection resulting from organisms which have antigens that 
are recognized and bound by antisera produced in response to the polypeptides of the 
present invention. As used herein, a vaccine is said to prevent or attenuate a disease if 
25 its administration to an animal results either in the total or partial attenuation (i.e., 
suppression) of a symptom or condition of the disease, or in the total or partial 
immunity of the animal to the disease. 

The administration of the vaccine (or the antisera which it elicits) may be for 
either a "prophylactic' 1 or "therapeutic" purpose. When provided prophylactically, 
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the compound! s") arc provided in advance of any symptoms of Enterococcal infection. 
The prophylactic administration of the compounds s) serves to prevent or attenuate 
any subsequent infection. When provided therapeutically, the compound(s) is 
provided upon or after the detection of symptoms which indicate that an animal may 

5 be infected with a member of the Enwrucoccus genus. The therapeutic administration 
of the compound(s) serves to attenuate any actual infection. Thus, the E. faecalis 
polypeptides, and fragments thereof, of the present invention may be provided either 
prior to the onset of infection (so as to prevent or attenuate an anticipated infection) 
or after the initiation of an actual infection. 

10 The polypeptides of the invention, whether encoding a portion of a native 

protein or a functional derivative thereof, may be administered in pure form or may be 
coupled to a macromolecular carrier. Example of such carriers are proteins and 
carbohydrates. Suitable proteins which may act as macromolecular carrier for 
enhancing the immunogenicity of the polypeptides of the present invention include 

15 keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum 
albumin, and ovalbumin. Methods for coupling the polypeptides of the present 
invention to such macromolecular carriers are disclosed in Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). 

20 A composition is said to be "pharmacologically or physiologically acceptable" 

if its administration can be tolerated by a recipient animal and is otherwise suitable for 
administration to that animal Such an agent is said to be administered in a 
"therapeutically effective amount" if the amount administered is physiologically 
significant. An agent is physiologically significant if its presence results in a 

25 detectable change in the physiology of a recipient patient. 

While in all instances the vaccine of the present invention is administered as a 
pharmacologically acceptable compound, one skilled in the art would recognize that 
the composition of a pharmacologically acceptable compound varies with the animal 
to which it is administered. For example, a vaccine intended for human use will 
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generally not be co-administered with Freund's adjuvant. Further, the level of purity 
of the E.faecalis polypeptides of the present invention will normally be higher when 
administered to a human than when administered to anon-human animal. 

As would be understood by one of ordinary skill in the art, when the vaccine 

5 of the present invention is provided to an animal, it may be in a composition which 
may contain salts, buffers, adjuvants, or other substances which are desirable for 
improving the efficacy of the composition. Adjuvants are substances that can be used 
to specifically augment a specific immune response. These substances generally 
perform two functions: (1) they protect the antigen(s) from being rapidly catabolized 

10 after administration and (2) they nonspecifically stimulate immune responses. 

Normally, the adjuvant and the composition are mixed prior to presentation to 
the immune system, or presented separately, but into the same site of the animal being 
immunized. Adjuvants can be loosely divided into several groups based upon their 
composition. These groups include oil adjuvants (for example, Freund's complete and 

15 incomplete), mineral salts (for example, A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , A1NH 4 (S0 4 ), silica, 
kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and 
certain natural substances (for example, wax D from Mycobacterium tuberculosis, as 
well as substances found in Cory neb acterium parvum, or Bordetella pertussis, and 
members of the genus Brucella. Other substances useful as adjuvants are the saponins 

20 such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in 
the present invention include aluminum salts, such as A1K(S0 4 ) 2 , AlNa(S0 4 ) 2 , and 
A1NH 4 (S0 4 ). Examples of materials suitable for use in vaccine compositions are 
provided in REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A. 
Osol, ed, Mack Publishing Co, Easton, PA, (1980) (incorporated herein by reference). 

25 The therapeutic compositions of the present invention can be administered 

parenterally by injection, rapid infusion, nasopharyngeal absorption 
(intranasopharangeally ), dermoabsorption, or orally. The compositions may 
alternatively be administered intramuscularly, or intravenously. Compositions for 
parenteral administration include sterile aqueous or non-aqueous solutions, 
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suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such 
as ethyl oleatc. Carriers or occlusive dressings can be used to increase skin 
permeability and enhance antigen absorption. Liquid dosage forms for oral 

5 administration may generally comprise a liposome solution containing the liquid 
dosage form. Suitable forms for suspending liposomes include emulsions, suspen- 
sions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, 
such as purified water. Besides the inert diluents, such compositions can also include 
adjuvants, w etting agents, emulsifying and suspending agents, or sweetening, 

1 o flavoring, or perfuming agents. 

Therapeutic compositions of the present invention can also be administered in 
encapsulated form. For example, intranasal immunization using vaccines encapsulated 
in biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, 
Shahin, R. et al. (1995) Infect. Immun. 63:1 195-1200. Similarly, orally administered 

15 encapsulated Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K. 
et al. (1997) Infect. Immun. 65:853-857. Encapsulated vaccines of the present 
invention can be administered by a variety of routes including those involving 
contacting the vaccine with mucous membranes (e.g., intranasally, intracolonicly, 
intraduodenally). 

20 Many different techniques exist for the timing of the immunizations when a 

multiple administration regimen is utilized. It is possible to use the compositions of 
the invention more than once to increase the levels and diversities of expression of the 
immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple 
immunizations are given, they will be given one to two months apart. 

25 According to the present invention, an ''effective amount" of a therapeutic 

composition is one which is sufficient to achieve a desired biological effect. Generally, 
the dosage needed to provide an effective amount of the composition will vary 
depending upon such factors as the animal's or human's age, condition, sex, and extent 
of disease, if any, and other variables which can be adjusted by one of ordinary skill in 
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the art. 

The antigenic preparations of the invention can be administered by either 
single or multiple dosages of an effective amount. Effective amounts of the 
compositions of the invention can vary from 0.01-1 ,000 [ig/m\ per dose, more 
5 preferably 0.1-500 |ig ; ml per dose, and most preferably 10-300 jig/ml per dose. 

Examples 

Example 1: Isolation of a Selected DNA Clone From the Deposited Sample ofE. 
faecalis 

10 Three approaches can be used to isolate a E. faecalis clone comprising a 

polynucleotide of the present invention from any E. faecalis genomic DNA library. 
The E. faecalis strain V586 has been deposited as a convienent source for obtaining a 
E. faecalis strain although a wide varity of strains E. faecalis strains can be used which 
are known in the art. 

15 E. faecalis genomic DNA is prepared using the following method. A 20ml 

overnight bacterial culture grown in a rich medium (e.g., Trypticase Soy Broth, Brain 
Heart Infusion broth or Super broth), pelleted, washed two times with TES (30mM 
Tris-pH 8.0, 25mM EDTA, 50mM NaCl), and resuspended in 5ml high salt TES 
(2.5M NaCV). Lysostaphin is added to final concentration of approx 50ug/ml and the 

20 mixture is rotated slowly 1 hour at 37C to make protoplast cells. The solution is then 
placed in incubator (or place in a shaking water bath) and warmed to 55C. Five 
hundred micro liter of 20% sarcosyl in TES (final concentration 2%) is then added to 
lyse the cells. Next, guanidine HC1 is added to a final concentration of 7M (3.69g in 
5.5 ml). The mixture is swirled slowly at 55C for 60-90 rnin (solution should clear). 

25 A CsCl gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCl and 
overlaying with 2.85M CsCl. The gradient is carefully overlayed with the DNA- 
containing GuHCl solution. The gradient is spun at 30,000 rprn, 20C for 24 hr and 
the lower DNA band is collected. The volume is increased to 5 ml with TE buffer. 
The DNA is then treated with protease K (10 ug/ml) overnight at 37 C, and 
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precipitated with ethanol. The precipitated DNA is resuspended in a desired buffer. 

In the first method, a plasmid is directly isolated by screening a plasmid E. 
fat'cuiis genomic DNA library using a polynucleotide probe corresponding to a 
polynucleotide of the present invention. Particularly, a specific polynucleotide with 
5 30-40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer 
according to the sequence reported. The oligonucleotide is labeled, for instance, with 
~T-y-ATP using T4 polynucleotide kinase and purified according to routine methods. 
(Sec, e.g.. Maniatis et aL Molecular Cloning: A Laboratory Manual Cold Spnng 
Harbor Press. Cold Spring, NY (1982).) The library is transformed into a suitable 

10 host, as indicated above (such as XL-1 Blue (Stratagene)) using techniques known to 
those of skill in the art. Sec, e.g., Sambrook et al. MOLECULAR CLONING: A 
LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et ah, 
CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, 
N.Y. 1989). The transformants are plated on 1.5% agar plates (containing the 

15 appropriate selection agent, e.g., ampicillin) to a density of about 150 transformants 
(colonies) per plate. These plates are screened using Nylon membranes according to 
routine methods for bacterial colony screening. See, e.g., Sambrook et al. 
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, 
N.Y. 2nd ed. 1989); Ausubel et al, CURRENT PROTOCALS IN MOLECULAR 

20 BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of 
skill in the art. 

Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends 
of a polynucleotide of Table 1 are synthesized and used to amplify the desired DNA 
by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out 
25 under routine conditions, for instance, in 25 |il of reaction mixture with 0.5 ug of the 
above DNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% 
(w/v) gelatin, 20 ^iM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 
0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94°C for 1 
min; annealing at 55°C for 1 min; elongation at 72°C for 1 min) are performed with a 
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Pcrkin-Elmer Cctus automated thermal cycler. The amplified product is analyzed by 
agarose gel electrophoresis and the DNA band with expected molecular weight is 
excised and purified. The PCR product is verified to be the selected sequence by 
subclonmg and sequencing the DNA product. 
5 Finally, overlapping oligos of the DMA sequences of Table 1 can be chemically 

synthesized and used to generate a nucleotide sequence of desired length using PCR 
methods known in the art. 

Example 2(a): Expression and Purification Enterococcal polypeptides in E. coli 
10 The bacterial expression vector pQE60 was used for bacterial expression of 

some of the polypeptide fragements used in the soft tissue and systemic infection 
models discussed below. (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a 
bacterial origin of replication ("ori"), an 1PTG inducible promoter, a ribosome binding 
15 site ("RBS"), six codons encoding histidme residues that allow affinity purification 
using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) 
and suitable single restriction enzyme cleavage sites. These elements are arranged such 
that an inserted DNA fragment encoding a polypeptide expresses that polypeptide 
with the six His residues (i.e., a "6 X His tag") covalently linked to the carboxyl 
20 terminus of that polypeptide. 

The DNA sequence encoding the desired portion of a E.faecalis protein of the 
present invention was amplified from E.faecalis genomic DNA using PCR 
oligonucleotide primers which anneal to the 5' and 3' sequences coding for the 
portions of the E.faecalis polynucleotide shown in Table 1 . Additional nucleotides 
25 containing restriction sites to facilitate cloning in the pQE60 vector are added to the 5' 
and 3 1 sequences, respectively. 

For cloning the mature protein, the 5' primer has a sequence containing an 
appropriate restriction site followed by nucleotides of the amino terminal coding 
sequence of the desired E.faecalis polynucleotide sequence in Table 1. One of 
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ordinary skill in the an would appreciate that the point in the protein coding sequence 
where the 5' and ?' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of the complete protein shorter or longer than the mature form. 
The 3' primer has a sequence containing an appropriate restriction site followed bv 
5 nucleotides complementary to the 3" end of the polypeptide coding sequence of Table 
1 , excluding a stop codon. with the coding sequence aligned with the restriction site so 
as to maintain its reading frame with that of the six His codons in the pQE60 vector. 

The amplified E. faccalis DNA fragment and the vector pQE60 were digested 
with restriction enzymes which recognize the sites in the pnmcrs and the digested 

10 DNAs were then ligated together. The E. faccalis DNA was inserted into the 

restricted pQL-60 vector in a manner which places the E.faecalis protein coding region 
downstream from the lPTG-inducible promoter and in-frame with an initiating AUG 
and the six histidme codons. 

The ligation mixture was transformed into competent E. call cells using 

15 standard procedures such as those described by Sambrook et ah, supra.. E. coli strain 
M15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac 
repressor and confers kanamycin resistance ("Kanr"), was used m carrying out the 
illustrative example described herein. This strain, which was only one of many that 
are suitable for expressing a E. faccalis polypeptide, is available commercially 

20 (Q1AGEN, Inc., supra). Transformants were identified by their ability to grow on LB 
agar plates in the presence of ampicillin and kanamycin. Plasmid DNA was isolated 
from resistant colonies and the identity of the cloned DNA confirmed by restriction 
analysis, PCR and DNA sequencing. 

Clones containing the desired constructs were grown overnight ("O/N") in 

25 liquid culture in LB media supplemented with both ampicillin (100 |ig/ml) and 

kanamycin (25 (ig/ml). The O/N culture was used to inoculate a large culture, at a 
dilution of approximately 1:25 to 1 :250. The cells were grown to an optical density at 
600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-p-D-thiogalactopyranoside 
("IPTG") was then added to a final concentration of 1 mM to induce transcription 
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from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells 
subsequently were incubated further for 3 to 4 hours. Cells then were harvested by 
eentrifugation. 

The cells were then stirred for 3-4 hours at 4°C in 6M guanidine-HCl, pH 8. 
5 The cell debris was removed by eentrifugation, and the supernatant containing the E. 
faecalis polypeptide was loaded onto a nickcl-nitrilo-tri-acctic acid ("Ni-NTA") 
affinity resin column (Q1AGEN, Inc., supra). Proteins with a 6 x His tag bind to the 
Ni-NTA resin with high affinity were purified in a simple one-step procedure (for 
details see: The QIAexpressionist, 1995, Q1AGEN, Inc., supra). Briefly the 
10 supernatant was loaded onto the column in 6 M guanidine-HCl, pH 8, the column was 
first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with 10 
volumes of 6 M guanidine-HCl pH 6, and finally the E. faecalis polypeptide was 
cluted with 6 M guanidine-HCl, pH 5. 

The purified protein was then renatured by dialyzing it against 
1 5 phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 rnM 
NaCl. Alternatively, the protein could be successfully refolded while immobilized on 
the Ni-NTA column. The recommended conditions are as follows: renature using a 
linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, 
containing protease inhibitors. The rcnaturation should be performed over a period of 
20 1 .5 hours or more. After renaturation the proteins can be eluted by the addition of 
250 mM immidazole. Immidazole was removed by a final dialyzing step against PBS 
or 50 mM sodium acetate pH 6 buffer plus 200 mM NaCl. The purified protein was 
stored at 4° C or frozen at -80° C. 

Some of the polypeptide of the present invention were prepared using a non- 
25 denaturing protein purification method. For these polypeptides, the cell pellet from 
each liter of culture was resuspended in 25 mis of Lysis Buffer A at 4°C (Lysis Buffer 
A - 50 mM Na-phosphate, 300 mM NaCl, 10 mM 2-mercaptoethanol, 10% 
Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease inhibitor cocktail 
(Bochringer Mannheim #1873580) per 50 ml of buffer). Absorbance at 550 nm was 
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approximatcly 10-20 O.D.. nil. The suspension was then put through three 
freeze thaw cycles from -"0 V 'C (using a ethanol-dry ice bath) up to room temperature. 
The cells were lysed via sonication in short 10 sec bursts over 3 minutes at 
approximately SOW while kept on ice. The sonicated sample was then centnfuged at 
5 1 5,000 RPM for 30 minutes at 4 L 'C. The supernatant was passed through a column 
containing 1 .0 nil of CL-4B resin to pre-cieai the sample of any proteins that may 
bind to agarose non-specifically, and the flow-through fraction was collected. 

The prc-cleared flow-through was applied to a nickel-nitrilo-tn-acetic acid 
("Ni-NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag 

10 bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step 
procedure. Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 
4°C, the column was first washed with 10 volumes of Lysis Buffer A until the A280 
of the eluate returns to the baseline. Then, the column was washed with 5 volumes of 
40 mM Imidazole (92% Lysis Buffer A / 8% Buffer B) (Buffer B - 50 mM Na- 

15 Phosphate, 300 mM NaCl, 10% Glycerol, 10 mM 2-mercaptoethanol, 500 mM 
Imidazole. pH of the final buffer should be 7.5). The protein was eluted off of the 
column with a series of increasing Imidazole solutions made by adjusting the ratios of 
Lysis Buffer A to Buffer B. Three different concentrations were used: 3 volumes of 
75 mM Imidazole, 3 volumes of 150 mM Imidazole, 5 volumes of 500 mM 

20 Imidazole. The fractions containing the purified protein were analyzed using 8 %, 10 
% or 14% SDS-PAGE depending on the protein size. The purified protein was then 
dialyzed 2X against phosphate-buffered saline (PBS) in order to place it into an easily 
workable buffer. The purified protein was stored at 4°C or frozen at -80°. 

The following alternative method may be used to purify E.faecalis expressed 

25 in E coli when it is present in the form of inclusion bodies. Unless otherwise 
specified, all of the following steps are conducted at 4-1 0°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 
15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
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unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 1 00 mM 
Tns, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

5 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 x g for 1 5 min. The resultant pellet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

10 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g centrifugation for 1 5 min., the 
pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 x g) to remove insoluble particles, 

15 the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 1 2 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 

20 tangential filtration unit equipped with 0.16 |im membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 

25 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE, 

Fractions containing the E.faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 
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Biosvstems") and weak anion (Poros CM-20. Pcrscptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 
5 mM sodium acetate, pH 6.0 to 1.0 M NaCI, 50 mM sodium acetate, pH 6.5. 
Fractions are collected under constant A 2 &o monitoring of the effluent. Fractions 
containing the E. faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E. faecalis polypeptide exhibits greater than 95% purity after 
10 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 16% SDS-PAGE gel when 5 [ig of purified protein is 
loaded. The purified protein is also tested for endotoxm/LPS contamination, and 
typically the LPS content is less than 0.1 ng/ml according to LAL assays. 

1 5 Example 2(b): Atlcrnative Expression and Purification Enterococcal polypeptides in E. 
coli 

Tthe vector pQElO was alternatively used to clone and express some of the 
polypeptides of the present invention for use in the soft tissue and systemic infection 
models discussed below. The difference being such that an inserted DNA fragment 
20 encoding a polypeptide expresses that polypeptide with the six His residues (i.e., a "6 
X His tag") covalently linked to the amino terminus of that polypeptide. The bacterial 
expression vector pQElO (Q1AGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311) was used in this example . The components of the pQElO plasmid are 
arranged such that the inserted DNA sequence encoding a polypeptide of the present 
25 invention expresses the polypeptide with the six His residues (i.e., a "6 X His tag")) 
covalently linked to the amino terminus. 

The DNA sequences encoding the desired portions of a polypeptide of Table 
1 were amplified using PCR oligonucleotide primers from genomic £. faecalis DNA. 
The PCR primers anneal to the nucleotide sequences encoding the desired amino acid 
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sequence of a polypeptide of the present invention. Additional nucleotides containing 
restriction sites to facilitate cloning in the pQElO vector were added to the 5' and 3' 
primer sequences, respectively. 

For cloning a polypeptide of the present invention, the 5' and 3' primers were 
5 selected to amplify their respective nucleotide coding sequences. One of ordinary skill 
in the art would appreciate that the point in the protein coding sequence where the 5 ! 
and 3' primers begins may be varied to amplify a DNA segment encoding any desired 
portion of a polypeptide of the present invention. The 5' primer was designed so the 
coding sequence of the 6 X His tag is aligned with the restriction site so as to maintain 
10 its reading frame with that of E.faecalis polypeptide. The 3' was designed to include 
an stop codon. The amplified DNA fragment was then cloned, and the protein 
expressed, as described above for the pQE60 plasmid. 

The DNA sequences encoding the amino acid sequences of Table 1 may also 
be cloned and expressed as fusion proteins by a protocol similar to that described 
15 directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, 
Madison, Wl 5371 1) is preferentially used in place of pQElO. 

The above methods are not limited to the polypeptide fragements actually 
produced. The above method, like the methods below, can be used to produce either 
full length polypeptides or desired fragements therof. 

20 

Example 2(c): Alternative Expression and Purification of Enterococcal polypeptides 
in E. coli 

The bacterial expression vector pQE60 is used for bacterial expression in this 
example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). However, in 
25 this example, the polypeptide coding sequence is inserted such that translation of the 
six His codons is prevented and, therefore, the polypeptide is produced with no 6 X 
His tag. 

The DNA sequence encoding the desired portion of the E.faecalis amino acid 
sequence is amplified from an E.faecalis genomic DNA prep the deposited DNA 
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clones using PCR oligonucleotide primers which anneal to the 5' and 3 ! nucleotide 
sequences corresponding to the desired portion of the E.faecalis polypeptides. 
Additional nucleotides containing restriction sites to facilitate cloning in the pQE60 
vector are added to the 5' and 3' primer sequences. 

5 For cloning a E.faecalis polypeptides of the present invention, 5' and 3' 

primers are selected to amplify their respective nucleotide coding sequences. One of 
ordinary skill in the art would appreciate that the point in the protein coding sequence 
where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding 
any desired portion of a polypeptide of the present invention. The 3' and 5' primers 

10 contain appropriate restriction sites followed by nucleotides complementary to the 5' 
and 3' ends of the coding sequence respectively. The 3' primer is additionally designed 
to include an in -frame stop codon. 

The amplified E. faecalis DNA fragments and the vector pQE60 are digested 
with restriction enzymes recognizing the sites in the primers and the digested DNAs 

15 are then ligated together. Insertion of the E.faecalis DNA into the restricted pQE60 
vector places the E.faecalis protein coding region including its associated stop codon 
downstream from the lPTG-inducibie promoter and in-frame with an initiating AUG. 
The associated stop codon prevents translation of the six histidine codons 
downstream of the insertion point. 

20 The ligation mixture is transformed into competent E. coli cells using standard 

procedures such as those described by Sambrook et al. E. coli strain M15/rep4, 
containing multiple copies of the plasmid pREP4, which expresses the lac repressor 
and confers kanamycin resistance ("Kanr"), is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are suitable for 

25 expressing E.faecalis polypeptide, is available commercially (Q1AGEN, Inc., supra). 
Transformants are identified by their ability to grow on LB plates in the presence of 
ampicillin and kanamycin. Plasmid DNA is isolated from resistant colonies and the 
identity of the cloned DNA confirmed by restriction analysis, PCR and DNA 
sequencing. 
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Clones containing the desired constructs are grown overnight ("O/N") in liquid 
culture in LB media supplemented with both ampicillm (100 (lg/ml) and kanarnycin 
(25 |ig/ml). The O/N culture is used to inoculate a large culture, at a dilution of 
approximately 1 :25 to 1:250. The cells are grown to an optical density at 600 nm 
5 ( M OD600 M ) of between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is 
then added to a final concentration of 1 mM to induce transcription from the lac 
repressor sensitive promoter, by inactivating the lacl repressor. Cells subsequently 
are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation. 

To purify the E.faecalis polypeptide, the cells are then stirred for 3-4 hours at 
10 4°C in 6M guanidine-HCl, pH 8. The cell debris is removed by centrifugation, and the 
supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Na- 
acetate buffer pH 6, supplemented with 200 mM NaCl. Alternatively, the protein 
can be successfully refolded by dialyzing it against 500 mM NaCl, 20% glycerol, 25 
mM Tris/HCl pH 7.4, containing protease inhibitors. After renaturation the protein 
1 5 can be purified by ion exchange, hydrophobic interaction and size exclusion 

chromatography. Alternatively, an affinity chromatography step such as an antibody 
column can be used to obtain pure E.faecalis polypeptide. The purified protein is 
stored at 4° C or frozen at -80° C. 

The following alternative method may be used to purify E. faecalis 
20 polypeptides expressed in E coli when it is present in the form of inclusion bodies. 
Unless otherwise specified, all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 
1 5,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per 
25 unit weight of cell paste and the amount of purified protein required, an appropriate 
amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM 
Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension 
using a high shear mixer. 

The cells ware then lysed by passing the solution through a microfluidizer 
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I Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate 
is then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
eentrifugation at ^000 x g for 15 min, The resultant pellet is washed again using 0.5M 
NaCl. 100 mM Tris, 50 mM EDTA, pH 7.4. 
5 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 x g eentrifugation for 15 min., the 
pellet is discarded and the E. faecal is polypeptide-containing supernatant is incubated 
at 4°C overnight to allow further GuHCl extraction. 

Following high speed eentrifugation (30,000 x g) to remove insoluble particles, 
in the GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 
20 volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM 
EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C 
without mixing for 12 hours prior to further purification steps. 

To clarify the refolded E. faecalis polypeptide solution, a previously prepared 
15 tangential filtration unit equipped with 0.16 |im membrane filter with appropriate 
surface area (e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is 
employed. The filtered sample is loaded onto a cation exchange resin (e.g., Poros HS- 
50, Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 
6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same 
20 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is 

continuously monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the E. faecalis polypeptide are then pooled and mixed 
with 4 volumes of water. The diluted sample is then loaded onto a previously 
prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive 
25 Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins. 
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are 
washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 column is 
then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 
mM sodium acetate, pH 6.0 to 1 .0 M NaCl, 50 mM sodium acetate, pH 6.5. 
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Fractions are collected under constant A 2 go monitoring of the effluent. Fractions 
containing the E. faecalis polypeptide ( determined, for instance, by 16% SDS-PAGE) 
are then pooled. 

The resultant E. faecalis polypeptide exhibits greater than 95% purity after 
5 the above refolding and purification steps. No major contaminant bands are observed 
from Commassie blue stained 1 6% SDS-PAGE gel when 5 |ig of purified protein is 
loaded. The purified protein is also tested for endotoxin/LPS contamination, and 
typically the LPS content is less than 0.1 ng/ml according to LAL assays. 

10 Example 2(d): Cloning and Expression of E. faecalis in Other Bacteria 

E. faecalis polypeptides can also be produced in: E. faecalis using the methods 
of S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. I. Moreno (1996) Protein 
Expr. Purif. 8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 
AppL Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods 

15 Chang et al, U.S. Patent No. 4,952,508. 

Example 3: Cloning and Expression in COS Cells 

A E. faecalis expression plasmid is made by cloning a portion of the DNA 
encoding a E. faecalis polypeptide into the expression vector pDNAI/Amp or 

20 pDNAIII (which can be obtained from lnvitrogen, Inc.). The expression vector 

pDNAl/amp contains: (1) an E. coli origin of replication effective for propagation in 
E. coli and other pr ok ary otic cells; (2) an ampicillin resistance gene for selection of 
plasmid-containing prokaryotic cells; ( 3) an SV40 origin of replication for propagation 
in eukaryotic cells; (4) a CMV promoter, a poly linker, an SV40 intron; (5) several 

25 codons encoding a hemagglutinin fragment (i.e., an n HA" tag to facilitate purification) 
followed by a termination codon and polyadenylation signal arranged so that a DNA 
can be conveniently placed under expression control of the CMV promoter and 
operably linked to the SV40 intron and the polyadenylation signal by means of 
restriction sites in the polylinker. The HA tag corresponds to an epitope derived 
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from the influenza hemagglutinin protein described by Wilson et al. 1984 Cell 37:767. 
The fusion of the HA tag to the target protein allows easy detection and recovery of 
the recombinant protein with an antibody that recognizes the HA epitope. pDNAHI 
contains, in addition, the selectable neomycin marker. 

A DN A fragment encoding a E. faccalis polypeptide is cloned into the 
polylinker region of the vector so that recombinant protein expression is directed by 
the CMV promoter. The plasmid construction strategy is as follows. The DNA from 
a E.faecalis genomic DNA prep is amplified using primers that contain convenient 
restriction sites, much as described above for construction of vectors for expression of 
10 E. faecalis in E. coli. The 5' primer contains a Kozak sequence, an AUG start codon, 
and nucleotides of the 5' coding region of the E. faecalis polypeptide. The 3' primer, 
contains nucleotides complementary to the 3' coding sequence of the E.faecalis DNA, 
a stop codon, and a convenient restriction site. 

The PCR amplified DNA fragment and the vector, pDNAl/Amp, are digested 
15 with appropriate restriction enzymes and then ligated. The ligation mixture is 

transformed into an appropriate E. coli strain such as SURE™ (Stratagene Cloning 
Systems, La J oil a, CA 92037), and the transformed culture is plated on ampicillin 
media plates which then are incubated to allow growth of ampicillin resistant colonies. 
Plasmid DNA is isolated from resistant colonies and examined by restriction analysis 
20 or other means for the presence of the fragment encoding the E. faecalis polypeptide 
For expression of a recombinant E.faecalis polypeptide, COS cells are 
transfected with an expression vector, as described above, using DEAE-dextran, as 
described, for instance, by Sambrook et al. (supra). Cells are incubated under 
conditions for expression of E.faecalis by the vector. 
25 Expression of the E.faecalis-HA fusion protein is detected by radiolabeling 

and immunoprecipitation, using methods described in, for example Harlow et al, 
supra.. To this end, two days after transfection, the cells are labeled by incubation in 
media containing 35 S-cysteme for 8 hours. The cells and the media are collected, and 
the cells are washed and the lysed with detergent-containing RIPA buffer: 150 mM 
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NaCl, 1 % NP-40, 0. 1 % SDS, 1 % NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as 
described by Wilson ct al. (supra ). Proteins are precipitated from the cell lysate and 
from the culture media using an HA-specific monoclonal antibody. The precipitated 
proteins then are analyzed by SDS-PAGE and autoradiography. An expression 
5 product of the expected size is seen in the cell lysate, which is not seen in negative 
controls. 

Example 4: Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of E.faecalis polypeptide in this 

10 example. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession 
No. 37146), The plasmid contains the mouse DHFR gene under control of the SV40 
early promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate 
activity that are transfected with these plasmids can be selected by growing the cells 
in a selective medium (alpha minus MEM, Life Technologies) supplemented with the 

15 chemotherapeutic agent methotrexate. The amplification of the DHFR genes in cells 
resistant to methotrexate (MTX) has been well documented. See, e.g., Alt et al., 
1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 
1097:107-143; Page etal, 1991, Biotechnology 9:64-68. Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the target 

20 enzyme, DHFR, as a result of amplification of the DHFR gene. If a second gene is 
linked to the DHFR gene, it is usually co-amplified and over-expressed. It is known 
in the art that this approach may be used to develop cell lines carrying more than 
1,000 copies of the amplified gene(s). Subsequently, when the methotrexate is 
withdrawn, cell lines are obtained which contain the amplified gene integrated into one 

25 or more chromosome(s) of the host cell. 

Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) 
of the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al. 
(1985) Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the 
immediate early gene of human cytomegalovirus (CMV), Boshart, et al., 1985, Cell 
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41 :52 1-530. Downstream of the promoter are the following single restriction enzyme 
cleavage sites that allow the integration of the genes: Bam HI, Xba 1, <m&Asp 718. 
Behind these cloning sites the plasmid contains the 3' intron and polyadenylation site 
of the rat prepromsuhn gene. Other high efficiency promoters can also be used for the 
5 expression, e.g., the human fi-actin promoter, the SV40 early or late promoters or the 
long terminal repeats from other retroviruses, e.g., HIV and HTLVI. Clontech's Tet- 
Off and Tet-On gene expression systems and similar systems can be used to express 
the E. faecal is polypeptide in a regulated way in mammalian cells (Gossen et al, 1992, 
Proc. Natl. Acad. Sci. USA 89:5547-555 1 . For the polyadenylation of the mRNA 
10 other signals, e.g., from the human growth hormone or globin genes can be used as 
well. Stable cell lines carrying a gene of interest integrated into the chromosomes can 
also be selected upon co-transfection with a selectable marker such as gpt, G418 or 
hygromycin. It is advantageous to use more than one selectable marker in the 
beginning, e.g., G41 8 plus methotrexate. 
15 The plasmid pC4 is digested with the restriction enzymes and then 

dephosphorylated using calf intestinal phosphates by procedures known in the art. 
The vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E. 
faecalis polypeptide is amplified using PCR oligonucleotide primers corresponding to 
the 5 1 and 3' sequences of the desired portion of the gene. A 5' primer containing a 
20 restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the 5' 
coding region of the E. faecalis polypeptide is synthesized and used. A 3' primer, 
containing a restriction site, stop codon, and nucleotides complementary to the 3' 
coding sequence of the E. faecalis polypeptides is synthesized and used. The 
amplified fragment is digested with the restriction endonucleases and then purified 
25 again on a 1 % agarose gel. The isolated fragment and the dephosphorylated vector are 
then hgated with T4 DNA ligase. E. coli HB101 or XL-1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 
pC4 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene are used for 
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transfection. Five |ig of the expression plasmid pC4 is cotransfected with 0.5 \ig of 
the plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin™ or 
LipofcctAMINE.™ (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme 

5 that confers resistance to a group of antibiotics including G41 8. The cells are seeded 
in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha 
minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml 
G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well 

10 petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 

100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 |lM, 2 |iM, 5 |iM, 10 mM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 

15 1 00-200 |iM. Expression of the desired gene product is analyzed, for instance, by 
SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 5: Quantitative Murine Soft Tissue Infection Model for E.faecalis 

Compositions of the present invention, including polypeptides and peptides, 

20 are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E.faecalis) using the following quantitative 
murine soft tissue infection model. Mice (e.g., NIH Swiss female mice, approximately 
7 weeks old) arc first treated with a biologically protective effective amount, or 
immune enhancing/stimulating effective amount of a composition of the present 

25 invention using methods known in the art, such as those discussed above. See,e.g., 
Harlow et ah, ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug 
per animal. 
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The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 
cfu'ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1:2 with sterilized Cytodex 3 microcarrier beads preswollen 
5 in sterile PBS (3g/100ml). Mice are anesthetize briefly until docile, but still mobile 
and injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal 
subcutaneously in the inguinal region. After four days, counting the day of injection 
as day one, mice are sacrificed and the contents of the abscess is excised and placed in 
a 15 ml conical tube containing 1.0ml of sterile PBS. The contents of the abscess is 
10 then enzymaticaily treated and plated as follows. 

The abscess is first disrupted by vortexing with sterilized glass beads placed in 
the tubes. 3.0mls of prepared enzyme mixture (1 .0ml Collagenase D (4.0 mg/ml), 
1 .0ml Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a 
20 min. incubation at 37C. The solution is then centrifuged and the supernatant 
15 drawn off. 0.5 ml dH20 is then added and the tubes are vortexed and then incubated 
for 10 min. at room temperature. 0.5 ml media is then added and samples are serially 
diluted and plated onto agar plates, and grown overnight at 37C. Plates with distinct 
and separate colonies are then counted, compared to positive and negative control 
samples, and quantified. The method can be used to identify composition and 
20 determine appropriate and effective doses for humans and other animals by comparing 
the effective doses of compositions of the present invention with compositions 
known in the art to be effective in both mice and humans. Doses for the effective 
treatment of humans and other animals, using compositions of the present invention, 
are extrapolated using the data from the above experiments of mice. It is appreciated 
25 that further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 



Example 6: Murine Systemic Neutropenic Model for E. faecalis Infection 
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Compositions of the present invention, including polypeptides and peptides, 
are assayed for their ability to function as vaccines or to enhance/stimulate an immune 
response to a bacterial species (e.g., E.faecalis) using the following qualitative murine 
systemic neutropenic model. Mice (e.g., NIH Swiss female mice, approximately 7 

5 weeks old) are first treated with a biologically protective effective amount, or immune 
enhancing/stimulating effective amount of a composition of the present invention 
using methods known in the art, such as those discussed above. See, e.g., Harlow et 
al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory 
Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug per animal. 

10 Mice are then injected with 250 - 300 mg/kg cyclophosphamide intraperitonially. 

Counting the day of CP. injection as day one, the mice are left untreated for 5 days to 
begin recovery of PMNL'S. 

The desired bacterial species used to challenge the mice, such as E.faecalis, is 
grown as an overnight culture. The culture is diluted to a concentration of 5 X 10 8 

15 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired 
doses are further diliuted 1 :2 in 4% Brewer's yeast in media. 
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The 
Brewer's yeast solution alone is used as a control. The mice are then monitered twice 
daily for the first week following challenge, and once a day for the next week to 

20 ascertain morbidity and mortality. Mice remaining at the end of the experiment are 
sacrificed. The method can be used to identify compositions and determine 
appropriate and effective doses for humans and other animals by comparing the 
effective doses of compositions of the present invention with compositions known in 
the art to be effective in both mice and humans. Doses for the effective treatment of 

25 humans and other animals, using compositions of the present invention, are 

extrapolated using the data from the above experiments of mice. It is appreciated that 
further studies in humans and other animals may be needed to determine the most 
effective doses using methods of clinical practice known in the art. 
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The disclosure of all publications (including patents, patent applications, 
journal articles, laboratory manuals, books, or other documents) cited herein are 
hereby incorporated by reference in their entireties. 

The present invention is not to be limited in scope by the specific 
5 embodiments described herein, which are intended as single illustrations of individual 
aspects of the invention. Functionally equivalent methods and components arc within 
the scope of the invention, in addition to those shown and described herein and will 
become apparant to those skilled m the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of 
10 the appended claims. 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 



EF001-1 (SEQ ID NO:I) 



TGAAAGAATA TTGCCAGAAC GTGGCGAGCA 
AAAAAAATGA AGTTCAAAAC TCTAGCAACA 
TTGGGGGCTT GTGGTAACGG TAATGGGGCC 
AAGGAAGATA CGACAATCAC TTTCTGGCAT 
ACAAAATTAA CGAAAGACTT CATGAAAGAA 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA 
TTACCAACAA TTACGCAAGC GTACCCAGGC 
TTAGTGGACT T AAAA C CAT A TATGGATGAT 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA 
AATAAATCGA CAGAAaTGTT ATTCTATAAT 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG 
AACAAAGGCG TTGATTTTAA TAAAGACTTA 
GTGGACTATT ACCGTGATGG TATCGAAGCA 
TATTTATCTG GCCCATTTGC AAACAAAAAG 
GCTGGTTTTG TTCAAAAAGA TGCTGAAGCT 
CCTGAAAAAA TCAACTTACA ACAAGGAACA 
GAACAACGGA CAGCGGCATT TGAATTCATG 
TACTGGGCAC AACAAACAGG TTATATGCCA 
TACAAAAATT CTAAGACAAC CAAAGTACCT 
TTCGCTATCC CAGTAGAAGA AAATGCTGAT 
GAAAGTATTT TTGCTTCATC AAATAAAGAC 
CAATTTGAAC AAGCATGGAA CCAATAA 



AATTGTTTTA TAAATTTTTT TAAGGGAGAG 
ACAGTGTTAG CAACCGCAGC TATTTTCGCA 
AAAGAATCAA ACGATATTGT GAAAGAAGTG 
GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TGGTTATGGA ATGCTGCACA AGATGAAATG 
GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATCGACGGCA AACAATACGG CATTCCATTT 
GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GCTTCTAAAA CAATTTACGA AAAATCCAAC 
TTAAATAACT ATTACGCAAT TGGAATGAAA 
GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GGTTACTTCC GCACAGCTGG TTCAGATAAA 
GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
GGTGGCTATG AATACGGTGT TGCACCACGT 
GATATTTATA TGTTCGATAG TGCTACGCCA 
AAATTCTTAG CTACTCCTGA TTCACAATTG 
ATTTTAGAAT CTGTTTTACA CAGTGATGAG 
GCACAACTTG AAAACGCAGT AAAAGATTTA 
TCAGCCTATA ATGAAATGCG GACAATTATG 
ACGAGAAAAT TATTGAAAGA TGCAACATCA 



EF001-2 <SEQ ID NO : 2 ) 



MKFKTLATT VLATAAIFAL GACGNGNGAK ! 
EDTTITFWHA MNGVQEEALT KLTKDFMKEN 
PTITQAYPGW LWNAAQDEML VDLKPYMDDD 
KSTEMLFYNA DLLKEYGVEV PKTLEELKEA 
KGVDFNKDLD LTSKD3QEW DYYRDGIEAG 
GFVQKDAEAG GYEYGVAPRP EKINLQQGTD 
WAQQTGYMPI LESVLHSDEY KNSKTTKVPA 
SIFASSNKDT RKLLKDATSQ FEQAWNQ 



PKIKVELQNQ SAYPDLQAKI NSTLTSPKDL 

TIGWKDAEPI REVLLDGAKI DGKQYGIPFN 

SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 

YFRTAGSDKY LSGPFANKKV AMFVGSIAGA 

IYMFDSATPE QRTAAFEFMK FLATPDSQLY 

QLENAVKDLF AIPVEENADS AYNEMRTIME 



EF001-3 (SEQ ID NO : 3 ) 

TT GTGGTAACGG TAATGGGGCC AAAGAATCAA ACGATATTGT GAAAGAAGTG 
AAGGAAGATA CGACAATCAC TTTCTGGCAT GCAATGAATG GGGTTCAAGA AGAAGCGTTA 
ACAAAATTAA CGAAAGACTT CATGAAAGAA AATCCAAAAA TTAAAGTGGA ATTACAAAAT 
CAATCTGCTT ACCCTGATTT ACAAGCCAAA ATCAATTCGA CTTTAACTTC ACCAAAAGAT 
TTACCAACAA TTACGCAAGC GTACCCAGGC TGGTTATGGA ATGCTGCACA AGATGAAATG 
TTAGTGGACT TAAAACCATA TATGGATGAT GACACAATCG GCTGGAAAGA TGCAGAGCCA 
ATTCGTGAAG TATTGTTAGA CGGCGCCAAA ATCGACGGCA AACAATACGG CATTCCATTT 
AATAAATCGA CAGAAATGTT ATTCTATAAT GCTGATTTGT TGAAAGAATA TGGTGTTGAA 
GTACCGAAAA CATTAGAGGA ATTAAAAGAA GCTTCTAAAA CAATTTACGA AAAATCCAAC 
AAAGAAGTCG TTGGTGCTGG TTTTGACTCG TTAAATAACT ATTACGCAAT TGGAATGAAA 
AACAAAGGCG TTGATTTTAA TAAAGACTTA GATTTAACAA GCAAAGATTC ACAAGAAGTC 
GTGGACTATT ACCGTGATGG TATCGAAGCA GGTTACTTCC GCACAGCTGG TTCAGATAAA 
TATTTATCTG GCCCATTTGC AAACAAAAAG GTAGCAATGT TTGTCGGTAG TATTGCTGGT 
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GCZGGTTZ^G TTCAAAAAGA T3GTGAAGCT GGTG3CTATG AATAGGGTGT TGCACCACGT 
CCTj .AAAAAA TC AACTTACA A I AAGGAACA GA7A7TTATA 7GT7CGATAG TGCTACGCGA 
GAATAACGGA CAGOGGCAT7 7GAATTCATG AAAT7C7TAG CTACTGCTGA TTCACAATTG 
TAC7GGGGAC AATAAACAGG TTATATGCCA ATTTTAGAAT GTGTTTTACA GAGTGATGAG 
7ACAAAAAT7 CTAAGACAAC CAAAGTACCT GGACAACTTG AAAAGGCAGT AAAAGATTTA 
TTGGGTATCC CAGTAGAAGA AAAT 3 CTG AT 7CAGGG7A7A ATGAAATG GG GACAATTATG 
GAAAGTATTT 77C0T7GATC AAA T AAA GAG AGGAGAAAA7 TATTGAAAGA TGCAAGATCA 
CAATTTGAAC AAGCATGGAA CCAA 



EF0 01-4 (SEQ ID NO : 4 ) 

CGNGNGAK ESNLIVKEVK 

ECTTITFWHA MNGVQEEALT KLTKDFKKEN 

PTITQAYPGW LWNAAQDEML Vf-LKPYMDDD 

KSTEMLFYNA DLLKEYGVEV PFTLEELKEA 

KGVCFNKDLD LTSKDSQEW UYYRDGIEAG 

GFVQKDAEAG GYEYGVAPRP EKINLQQGTD 

WAQQTGYMPI LESVLHSDEY KNSKTTKVPA 

SIFASSNKDT RKLLKDATSQ FEQAWNQ 



PKIKYELQNQ SAYPDLQAKI NSTLTSPKDL 

TIGWKDAEPI REVLLDGAKI DGKQYGI PFN 

SKTIYEKSNK EWGAGFDSL NNYYAIGMKN 

YFRTAGSDKY LSGPFANKKV AMFVGSIAGA 

IYMFDSATPE QRTAAFEFMK FLATPDSQLY 

QLENAVKDLF AIPVEENADS AYNEMRTIME 



EF0C2-1 (SEQ ID NO : 5 ) 

TAAATAGCGG AGGTAGTACA AATGAAATTT 
TTAGCAGTGG CGGCAGTAAC TTTAACAGCA 
GAAAAGAGTG AAGATGGCAA AACAAAATTA 
CCAGAATTTG AGAAATTATT CAGAGCTTTT 
CCGGTGGACA TTGCTTCAGA TGATTATGAC 
GATACGACGG ATATTTTAAC CATGAAAAAC 
AATCAATTGG TGGATTTAAC CGATCACGTT 
AGTTACGAGA TGTATGAAAT CGATGGTAAA 
TGGGTATTGT ATTACAATAA AAAAATGTTT 
TTAACTTGGG ATGAATATGA AGCGTTAGCG 
TATGGTGCCT ATCAACATAC TTGGCGCTCA 
AATGCCAATT TGATTGAACC AAAATACAAT 
AGAATGCAAA AAGATCAATC ACAAATGGAT 
TATCAATCAC AATTTGAAAA TTCAAAAGCG 
GGGACTTTAT TAAGAAACAT TGATGATGGC 
ATACCACAAC AAGAAAAAGG CAAAGCAACT 
AATAAAAACA GTAAAAAACA AAAAGCTGCT 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG 
GATAAAATCT ACTTTGCAAG AAAAGGAATG 
ACCCAGATAC AATTAATTTA G 



TGGAAAAAAG GCTTAACAGC GGCAGCGCTG 
TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 
ACAGTAACTA CTTGGAATTA TGACACGACC 
GAAGCGGAAA ATCCTGATAT CACTATTGAA 
ACAAAAGTAA CAACGATGCT TTCATGAGGA 
TTACTTTCAT ATTCTAATTA CGCGCTACGC 
AAAGATTTAG ATATCGAACC TGCCAAAGCA 
ACCTATGCTC AGCCTTACCG TACAGATTTC 
GATGAAGCCG GAATTGCCTA TCCCGATAAC 
AAAAAATTAT CTAAACCAGA AGAACAAGTA 
ACCGTTCAAG CGATTGCTGC TGCTCAAAAC 
TATATGGAAA CTTATTATGA TCGCGCATTG 
TTTGGAACAG CAAAATCAAC AAAAGTAACG 
GCGATGATGT ACATGGGTAG CTGGTACATG 
AAAACAAATG TCGAATGGGG GATTGCCGAA 
ACCTTTGGCT CACCGACAAG TTTTGCAATT 
CAAAAATTCT TAGACTTTGC TTCAGGTAAA 
GTGGTTCCTT CTTATAAAAC AGATGAAATT 
CCTTCAGACG AGTCTCACAA AAAGCCTTTA 



EF002-2 (SEQ ID NO : 6 ) 

MKFW KKGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 
EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL LSYSNYALRN 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MKYMGSWYMG TLLTNIDDGK TNVEWGIAEI 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 
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KIYFARKGMP SDESHKKPLT QIQLI 

EF002-3 (SEQ ID NO : 7 ) 

A TGTGGTGGTT CAAGTGAAAA GAAAGCAACT 

GAAAAGAGTG AAGATGGCAA AACAAAATTA ACAGTAACTA C TTGGAATT A 
CCAGAATTTG AGAAATTATT CAGAGCTTTT GAAGCGGAAA ATCCTGATAT 
CCGGTGGACA TTGCTTCAGA TGATTATGAC ACAAAAGTAA CAACGATGCT 
GATACGACGG ATATTTTAAC CATGAAAAAC TTACTTTCAT ATTCTAATTA 
AATCAATTGG TGGATTTAAC CGATCACGTT AAAGATTTAG ATATCGAACC 
AGTTACGAGA TGTATGAAAT CGATGGTAAA ACCTATGCTC AGCCTTACCG 
TGGGTATTGT ATTACAATAA AAAAATGTTT GATGAAGCCG GAATTGCCTA 
TTAACTTGGG ATGAATATGA AGCGTTAGCG AAAAAATTAT CTAAACCAGA 
TATGGTGCCT ATCAACATAC TTGGCGCTCA ACCGTTCAAG CGATTGCTGC 
AATGCCAATT TGATTGAACC AAAATACAAT TATATGGAAA GTTATTATGA 
AGAATGCAAA AAGATCAATC ACAAATGGAT TTTGGAACAG CAAAATCAAC 
TATCAATCAC AATTTGAAAA TTCAAAAGCG GCGATGATGT ACATGGGTAG 
GGGACTTTAT TAACAAACAT TGATGATGGC AAAACAAATG TCGAATGGGG 
ATACCACAAC AAGAAAAAGG CAAAGCAACT ACCTTTGGCT CACCGACAAG 
AATAAAAACA GTAAAAAACA AAAAGCTGCT CAAAAATTCT TAGACTTTGC 
GAAGGTGCAA AACTTTTAGC AGAAGTAGGG GTGGTTCCTT CTTATAAAAC 
GATAAAATCT ACTTTGCAAG AAAAGGAATG CCTTCAGACG AGTCTCACAA 
ACCCAGATAC AATTAATT 

EF002-4 (SEQ ID NO : 8 ) 

C GGSSEKKATE KSEDGKTKLT VTTWNYDTTP 

EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL 
QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD 
TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY 
MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK 
PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV 
KIYFARKGMP SDESHKKPLT QIQLI 



EF003-1 (SEQ ID NO : 9 ) 

TAGGAGGACA AAAGAATGAA GAAGTTTTAT TTAGCNACAT TCGCTGTTAT TGCAACAGTT 

ATTTTAGCTG CCTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 

GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 

AAAAAAGGGT ACAAAATTAA CATTATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 

GTGGAACATG ACGAAGCGGA TGCTAATTTT GCGCAACATC AACCCTTCAT GGAAATGTTT 

AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 

TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 

CCTAGCGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 

AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 

AACATCAGTT TTGAAAGCAT TGATTTACTG AATTTAGCTA AAGCCTATGA TGAAAAAGAC 

ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 

GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 

AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA TTCCTGCGTT TTAA 

EF003-2 {SEQ ID NO:10) 



TGACACGACC 
CACTATTGAA 
TTCATCAGGA 
CGCGCTACGC 
TGCCAAAGCA 
TACAGATTTC 
TCCCGATAAC 
AGAACAAGTA 
TGCTCAAAAC 
TCGCGCATTG 
AAAAG TAACG 
CTGGTACATG 
GATTGCCGAA 
TTTTGCAATT 
TTCAGGTAAA 
AGATGAAATT 
AAAGCCTTTA 



LSYSNYALRN 
EAGIAYPDNL 
METYYDRALR 
TNVEWGIAEI 
VPSYKTDEID 
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MKKFYL ATFAVIATYI LAACGGNKCA DCKEFKEITV AVQLESSKCI LEIAKKFAEK 
KGYKINIMEY SEIJVAYNFAY vHFEAEANEA CHCFFKEMFK KEKKADLVAV QPIYYFAGGF 
YSKEYQFAXE EPENAKYGIF SEPTNEGRAL AILNANGVIK LKEGVGFNG7 VAFWEaPKN 
ITFESIDLLN LAKAYDEKCI AXYFCYPAYL EPAGETTKDA ILLEDKEASK HYALQWTRK 
G E K D S E K I K V L K EAMTTK E V AE Y I KK N S KG AN I ? A F 

EFCC3-3 f SEC ID NO: II) 

CZGZ'GGGSG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC 
GTTGCCGTGC AATTGGAATC TTCAAAAGAT ATCTTGGAGA TTGCCAAGAA AGAAGCTGAG 
AAAAAAGGGT ACAAAATTAA CAT7ATGGAA GTGAGCGACA ATGTTGCCTA CAACGATGCC 
GT jCAACATG ACGAAGCGGA tgctaatttt GCGCAACATC AACCCTTCAT GGAAATGTTT 
AACAAAGAGA AAAAAGCTGA TTTAGTGGCT GTGCAACCGA TTTATTATTT TGCTGGTGGT 
TTCTATTCAA AAGAATACCA AGATGCGAAA GATTTACCTG AAAATGCCAA AGTGGGGATT 
CCTAGGGATC CAACCAATGA AGGTCGTGCT TTAGCAATTT TAAATGCAAA CGGCGTGATT 
AAATTAAAAG AAGGTGTCGG CTTTAACGGC ACGGTGGCAG ATGTCGTGGA AAATCCTAAA 
AACATCACTT TTGAAAGCAT TGA7TTACTG AATTTAGGTA AAGCCTATGA T 3 AAAAAGAC 
ATCGCTATGG TGTTCTGCTA CCCAGCCTAC TTAGAACCTG CTGGTTTAAC AACGAAAGAT 
GCGATCTTGT TAGAAGATAA AGAAGCAAGT AAACATTACG CATTGCAAGT TGTGACACGC 
AAAGGCGAAA AAGATAGCGA AAAAATCAAG GTTTTAAAAG AAGCGATGAC AACAAAAGAA 
GTTGCTGAAT ACATCAAGAA AAATTCTAAA GGCGCCAATA TTCCTGCGTT T 



EF003-4 (SEQ ID N0:12) 

CGGNKQA DCKEDKEITV AVCLESSKDI LEIAKKEAEK 

KGYKINIMEV SDNVAYNDAV QHDEADANFA QKQPFMEKFN KEKKADLVAV QPIYYFAGGF 
YSKEYQDAKD LPENAKVGIP SDPTNEGRAL AILNANGVIK LKEGVGFNGT VADWENPKN 
ITFES IDLLN LAKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQWTRK 
GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIFAF 



EF004-1 (SEC ID N0:13) 

TAAATCGAAA GAAGGATGAT AGAAATGAAA AAAATGATTA AATTTGCAGG CATTGCTCTT 
ATTTTTGCAG CTCTTCTCTC TGCCTGTAGC AACGCAAAAA ATAATACACA AAAGAAAGCC 
GAAACTGCTG CCCAGTCAAG CACTATTGAA GCTTCAGACA GTAACGAAAA CGAGCCTAAT 
ACAGAAAACA TAACCCAAGC AGTTAAACAG TTAGAAGAAA AATTTAAC TC TGACGAGAAA 
TTAGTAAAAA TAGATGTTAA AAATAATGTT AAAGATGACA CATCAGATAA CCCTCACGCT 
GTCATTACGG TTAAGGTAAT TAATGATGAA GCAAAAAAAA ATATGGAAGA AATGCAGACT 
GCGATAGATT CCAACTCAGG TACAGAGGCA CAAAAGACTG CCATATACGG AATTCAATTA 
AATGTTGAAG AAGTAGCCAA AACATTAGAA AATGATAACG ATGTTATTTC TTTCATCACA 
CCTTACACGA ATGGGAACGA CAGAACCATA GCAAAATCAA CTAAAAATGA AAATATTATT 
CCGTTAGTAA AATAA 

EF004-2 (SEC ID N0:14) 

mkk mikfagiali faallsacsn AKNNTQKKAE taaqsstiea sdsnenepnt 

ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAC KTAIYGICLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENI IP 
LVK 



EF004-3 (SEC ID NC:15) 
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CTGTAGC AACGCAAAAA AT, 
GAAACTGCTG CCCAGTCAAG 
ACAGAAAACA TAACCCAAGC 
TTAGTAAAAA TAGATGTTAA 
GTCATTACGG TTAAGGTAAT 
GCGATAGATT CCAACTCAGG 
AATGTTGAAG AAGTAGCCAA 
CCTTACACGA ATGGGAACGA 
CCGTTAGTAA AA 



lATACACA AAAGAAAGCC 
CACTATTGAA GCTTCAGACA 
AGTTAAACAG TTAGAAGAAA 
AAATAATGTT AAAGATGACA 
T AA TG A TG AA GCAAAAAAAA 
TACAGAGGCA CAAAAGACTG 
AACATTAGAA AATGATAACG 
CAGAACCATA GCAAAATCAA 



GTAACGAAAA CGAGCCTAAT 

AATTTAACTC TGACGAGAAA 

CATCAGATAA CCCTCACGCT 

ATATGGAAGA AATGCAGACT 

CCATATACGG AATTCAATTA 

ATGTTATTTC TTTCATCACA 

CTAAAAATGA AAATATTATT 



EF004-4 (SEQ ID NO: 16) 

CSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT 

ENITQAVKQL EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV ITVKVINDEA KKNMEEMQTA 
IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP 
LVK 



EF005-1 ( SEQ ID NO: 17 ) 

TAAAAAATGA AAAAACGATT GACGATTGTG GGGATGCTTT TTCTGGCCAT TTTAGTAATG 
GTTGGTTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
GAATCCTTTG CCAATAGTGT AGCTAAACTG GATCAACAGC GCG AGGAAAG CAAGAATAAC 
TGGCCTGCAG AAGACTATGC TACAATTACT AAACGTTTGA AAAAAGGCTT AGATAAAATT 
GTTGC CACAG AATCAGCCAA TTCTGGGAAT GGCAATGTTT TAGTGGTCTC TCATGGCTTG 
AGTATTTCAG CGTTGTTAGC AACTTTATTT GATGATTTTA AAGTCCCAGA AGGCGGTTTG 
AAGAATGCTA GTGTCACAAC AATTCATTAC AAAAATGGCG AATATACTTT GGATAAAGTC 
AATGATGTCA GCTACTTAGA AGCAGGCGAA AAAGAATCAA AATAA 

EF005-2 (SEQ ID NO-.18) 

MKKRLTIVG MLFLAILVMV GCGKNQQATT KEKETKPEEL TLYIVRHGKT MLNTTDRVQG 
WSDAVLTPEG EKWTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEFMKNMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 

EF005-3 { SEQ ID NO:19) 

TTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA 
CTAACTCTTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGGA CCGCGTACAA 
GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT 
GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA 
ACTGCTCAAC TTATTTTAGA TCAAAATAAA GCAGGCAAAG ACCTTGAAGT CGTGCGTGAC 
CCAGATTTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTGG 
CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGAAGAAT TTATGAAAAA CATGACTCCT 
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GAATCCTTTG CCAATAGTGT 

TGGC CTGCA Z AAGACTATGC 

GTTGCCA^Aj aatcagccaa 

AGTATTTCAG CGT7GTTAGC 

AAGAATGC7A GTG7CACAAC 

AATGATGTCA GCTACTTAGA 

EF005-4 ( SEQ ID NC:20) 



AGCTAAAGTG GATCAACAGC 
TACAATTACT AAA2GT77GA 
TTC7GGGAAT GGCAATG7TT 
AAC77TATT7 GA7GAT777A 
AAT7CATTAC AAAAATG G C G 
AGCAGGCGAA AAAGAATCAA 



GCGAGGAAAG CAAGAA7AAC 
AAAAAGGCT7 AGATAAAATT 
7AG7GGTCTC 7CATGGCTTG 
AAGTCCCAGA AGGGGGTTTG 
AA7ATACT77 GGATAAAGTC 
AA 



CGKNQQAT7 KEKETKPEEL TLYIVRHGXT MLNT7DRVQG 

WS3AVLTPEG EKWTATGIG LXDVAFQICAY SSDSGRALQT AQLILDQNKA GKDLEWRDP 
DLREFNFGSY ECOLNKTMWQ DIADDQGVSL EEFMKJCMTPE SFANSVAKLD QQREESKNNW 
PAEDYATITK RLKKGLDK IV ATESANSGNG NVLWSHGLS ISALLATLFD DFKVPEGGLK 
KASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK 



EF006- 1 (SEQ ID NO : 2 I ) 

TAAACGATAA ATGGAGGGAA TAAGA7GAAA AAACGTACAT TATGGTCAGT AATTACTGTA 

GCAGTAGCTG TCTTAGTTTT AGGGGCTTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 

AAAGTTGGAG GTTCACCAGT TGCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 

GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 

GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT GTTTAATGAA 

GCGGTTAAAG AAAATGATTA TGAGTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 

GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 

GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 

ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 

ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 

AATGAAGAAG GGGCTGCGGT TTTAATTAAC TGAAACTTTG CCGTGGATCA AGGATTAAAT 

CCClx 21 A ^ 1 £ n T\ r r*r^nr^-h r ^ r nr^r^ r^mm * * * * * * mm ^ > ^ „ ,-™-r, » ™ ~ , „, _ _ 

~ ^^^^^^ xo^, v-x ino/wwA o^jj-Lrto j. j. ^./^^ 1 JL J rti\jU^>VH. I 'ATXATTGCG 

GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 
TAA 

EF006-2 ( SEQ ID NO:22) 

MKK RTLWSVITVA VAVLVLGACG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 
KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF006-3 (SEQ ID NO:23) 

TTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG 

AAAGTTGGAG CTTCACCAGT TCCACATGCA GAGATTTTAG AACATGTAAA ACCTTTATTA 
GAAAAAGAAG GCGTAAAATT AGAAGTGACG ACTTATACAG ATTACGTGCT ACCTAACAAG 
GCGTTGGAAA GTGGCGATAT CGATGCCAAC TATTTCCAAC ATGTGCCGTT CTTTAATGAA 
GCGGTTAAAG AAAATGATTA TGACTTTGTG AATGCAGGTG CGATTCATTT AGAACCAGTT 
GGGCTTTACT CGAAAAAATA CAAATCGTTA CAAGAAATTC CTGATGGTTC AACGATTTAC 
GTTAGCTCTT CCGTTTCAGA TTGGCCACGC GTATTAACTA TCTTAGAAGA TGCTGGTTTA 
ATCACGCTGA AAGAAGGGGT AGACCGGACA ACTGCTACTT TCGATGATAT TGATAAAAAT 
ACTAAAAAGT TGAAATTCAA TCATGAAAGT GATCCAGCAA TCATGACCAC TCTTTATGAC 
AATGAAGAAG GGGCTGCGGT TTTAATTAAC TCAAACTTTG CCGTGGATCA AGGATTAAAT 
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CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGTTCAC CTTATGCCAA TATTATTGCG 
GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAGC 
AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA 



EF006-4 ( SEQ ID NO:24) 

CG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE 

KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN AGAIHLEPVG 
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDD I DKNT 
KKLKFNHESD PAIMTTLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANIIAV 
RKEDENNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE 

EF008-1 ( SEQ ID NO:25) 

TAAACCGTGA GAAAGAAATG GAGGAATCAA CGAATGAAAA AATTTAGTTT ATTTTTTTTA 
ACACTTTTAG CAGGGTTAAC GTTAGCTGCT TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 
AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT TAG TG AAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT A G TAG AAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
TCCAAAGCTT ATGATTTAAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN 
ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA 
TTTGTTGAAA CCAGTGTCGA TAAACGTAGT ATGGAACGGG TCTCAAAAGA AGTGAAACGA 
CCAATTTACG ATACACTTTT CACAGACTCT CTTGCCAAAG AAGGAACAGA AGGCGATACG 
TACTACAGCA TGATGAACTG GAATTTAACA AAAATCCATG ATGGCTTAAT GAGTAAATAA 



EF008-2 {SEQ ID NO:26) 

MKKFSLFFLT LLAGLTLAAC GNQAAEKKEK LAIVTTNSIL SDLVKNVGQD 
KIELHSIVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL ETGGNGWFNK LMKTAKKVEN 
KDYFSTSKNV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE NIRDVLVEKD PKNKDFYTEN 
AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS KAYDLNAAYI WEINTESQGT 
PEQMTTIIDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP IYDTLFTDSL AKEGTEGDTY 
YSMMNWNLTK IHDGLMSK 

EF008-3 (SEQ ID NO:27) 

T TGCGGGAATC AAGCCGCTGA AAAGAAAGAA 

AAATTAGCAA TTGTGACAAC GAACTCGATC CTATCTGATT T AG TG AAAAA TGTTGGGCAA 
GACAAAATTG AGCTGCATAG TATTGTGCCA ATTGGGACAG ACCCTCACGA ATATGAACCG 
TTACCAGAAG ACATTGCGAA AGCTTCTGAA GCGGACATTT TATTCTTTAA CGGCTTGAAC 
TTAGAAACAG GCGGAAATGG CTGGTTTAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG 
AATAAAGATT ACTTTTCTAC AAGCAAAAAT GTTACGCCAC AATATTTAAC AAGTGCCGGT 
CAAGAACAAA CAGAAGATCC ACATGCTTGG TTAGACATTG AAAATGGCAT TAAATATGTA 
GAAAACATTC GTGACGTGTT AGTAGAAAAA GATCCAAAAA ATAAAGATTT CTATACAGAA 
AACGCGAAAA ATTATACCGA AAAACTTAGC AAACTACATG AGGAAGCCAA AGCTAAATTT 
GCTGATATTC CTGATGATAA AAAATTATTA GTTACAAGTG AAGGTGCCTT TAAATATTTC 
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T C C AAA G C - - >f-i T 2 A ^ TTA,*ir. 

A ; _.-\ C C TGAAL rtAA _ GAl 

TTTGTTGAAA CCAGTGT2GA 

^ t\ ^ _ rVv_ -K^jU Tort 1 vJrtA'_ ^ \j 



GATTrtTTGAT ACCATTAAG.M. 
TAAfnCG . AG ^ ATGChACGG^ 
CACAGACTCT CTTGCCAA^G 

GnA.T-nr.^rt rtAAAT^ AT ; j 



77AACACAGA AAGTCAAGGK 
AATCAAAAGC ACCTGTGTTA 
TCTCAAAAGA AG TG AAA CG A 
AAGGAACAGA AGGCGATACG 
ATGGCTTAAT GAGTAAA 



efocs-4 (seq id hc :26> 

c gnqaaekkek laiyttnsil sdlvk*cvgqd 

kielhsivp: gtepheyefl pediaz-asea didffnslnl etggngwfnk lmktakkven 
kdyfstskny tpqyltsagq eqteefhawl d i eng i kyve nirdvlvekd pxnkdfyten 
akmyteklsk lheeakakfa dipddkkldv tsegafkyfs kaydlxaayi weintesqgt 
peqmtti iet ikkskapvlf vetsvdkrsm ervskevkrp i ydtlftesl akegtegdty 
y s mmnwnltp t 2hdglmsk 



EF009-1 ( SEQ ID NO: 29) 

TGACAAATGA AAAAATTTAG TAAATTAATT GGACTTATTG GGGTATTAGC TTTTACGATT 
GCAGGTTGTG CATCGGGGTG TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAAGGGCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAATAA 

EFC09-2 (SEQ ID NO : 3 C ) 

MEKFSKLIG LIGVLAFTIA GCASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 
LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV NAP LG I YANK 
LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA L£DVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQ ; QEETKKVI AETSKGANVP AWETFGKK 

EF009-3 (SEQ ID NO : 3 1 ) 

TTGTG CATCGGGGTC TGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGTT 
GTAGGAACAA AAAATGATGA ATGGGAATCG GTCAAAGACC GTTTGAAAAA GAAAAATATT 
GATTTACAAT TGGTAGAATT TACAGACTAT ACGCAACCAA ACGCAGCATT AGCAGAAAAA 
GAAATTGATT TAAATGCCTT TCAGCATCAA ATCTTTTTAG ACAATTACAA TAAAGAGCAT 
GGAACGAAAT TAGTATCAAT TGGCAATACA GTCAATGCAC CATTGGGAAT TTACGCTAAT 
AAATTGAAAG ATATCACGAA AATTAAAGAC GGCGGAGAAA TTGCTATTCC TAATGACCCA 
ACGAATGGCG GGCGGGCGTT AATTTTATTA CAAACTGCAG GACTGATAAA AGTAGATCCT 
GCGAAACAGC AACTACCGAC TGTCAGTGAT ATTACTGAAA ATAAACGCCA ATTGAAAATA 
ACTGAATTAG ATGCTACGCA AACAGCGCGC GCTTTACAAG ATGTCGATGC TTCAGTGATT 
AATAGCGGCA TGGCTGTCGA TGCTGGGTAT ACACCAGATA AAGATGCTAT TTTCTTAGAA 
CCTGTAAACG AAAAAGCGAA ACCTTATGTG AACATTGTCG TGGCCCGAGA AGAAGATCAA 
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GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC AAGAAGAAAC GAAAAAGGTC 
ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAA 



EF009-4 (SEQ ID NO:32) 

CASGSVKDT KTETVKLGW GTKNDEWESV KDRLKKKNID 

LQLVEFTDYT QPNAALAEKE IDLNAFQHQI FLDNYNKEHG TKLVSIGNTV NAPLGIYANK 
LKDITKIKDG GEIAI PNDPT NGGRALILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLKIT 
ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IWAREEDQE 
NKLYQKWEE YQQEETKKVI AETSKGANVP AWETFGKK 



EF010-1 (SEQ ID NO:33) 



TGAAAGAATA AAATTGTACA GGAGGAAATA 
CGTAGCTGGC GCACAGCTTT AGCTGTAGGA 
GCAGCGTCAA AAGGTGATGA TAGTACACCA 
CCAGATAATT ATGACCAATT AATCGATAAT 
GCAAAATTAA AAATGGAATT TGTTGGTTGG 
GTTGCTTCTG GTGAAAGCTA TGATATTTCA 
AAAGGCGCCT ATGCTGATTT AACTGATTTA 
CAATTGCCAG ATAACTATAT TAAAGGAAAT 
ATTTTAGGTA ACTCTTACGG TCAACAAGTT 
TACAATTTAG ATATTAGTAA AGTCGATGGT 
GAATTCCNTA AAAANGANCC AAATATTGCT 
ACAGGTAATT ATGACTTCCC TATTGGTAAC 
ACTGGCTCAC CAAAAATTAT TAACCAATAT 
GTCTTGCATC AATGGTATAA AGATGGCTTG 
CCATATGACT TAAATACCAA TACTTGGTTT 
TATGGTGATA CAATCTTAAC ACAAGCTGCT 
GAACCATTAA AAACAACAGC TCAAGCGCAA 
AAAAACAAAG AAAAATCTGT TGAATTGTTA 
AACGGACTTG TTTATGGTGA AGAAGGCAAA 
AAATTGTTGA AAGATTACAC ACCAACAACT 
TTAATCATTT GGCCAGAAGA ATCTGTCACT 
ATCGAAGAAG CAAAAGATTC ACCAATTCTT 
ACTGAAATCA CTAACGTTGC TACAGTTATG 
ACTGTTGATC CAGAAGAAAC ACTTCCAAAA 
GATAAAGTTC AAAAAGAAAT GCAAACACAA 



AGGAATGAAA AAATGGCAAA AAGGATTAGC 
CTAAGCGCGT GCGGTAAATC TTC AAAAG AT 
ACGTTATTAA TGTATCGTGT TGGGGACAAA 
GCGAATAAAA TTATCGAGAA AAAAATTGGG 
GGCGATTGGG ACCAAAAAAT GTCAACAATC 
TTAGCACAAA ATTATGCAAC GAATGCACAA 
GCACCTAAAT ATGCCAAAGA AGCCTATGAT 
ACGATTAATG GAAAACTGTA TGCGTTCCCA 
TTAACTTTTA ATAAAGAATA TGTC GAT AAA 
AGTTATGAAA GTGCAACGGA AGTTCTAAAA 
GCTTTTGCTA TCGGCCAAAC ATTCTTTGCA 
CAATATCCAT TTGCAGTAAA AACAACTGAT 
GCCGACAAAG ACATGATTAA TAACTTAAAA 
ATTCCAACAG ATGCTGCTAC AAGTACAACA 
ATGCGTCAAG AAACACAAGG ACCTATGGAT 
GGCAAACCAC TTGTTTCTCG TCCACTAACA 
ATGGCTAACT ATGTTGTTGC AAACACGTCT 
GGTTTATTAA ACAGCAATCC AGAATTGTTA 
CAATATGAAA AAGTTGGCGA TGATCGTGTG 
CATTTGAGTG CTTGGAACAC AGGAAACAAC 
GAAGAAATGG TTAAAGAACG TGATAAGAGC 
GGTTTTACTT TTGTAAATGA TAAAGTGAAA 
AACCGTTACG CAGCAAGCTT AAATACAGGA 
TTAATGGATG ACCTAAAAAC AGCTGGCTGG 
TTAGACGAAT ATATCCAATC TCAAAAATAA 



EF010-2 (SEQ ID NO:34) 



MAKRISR SWRTALAVGL SACGKSSKDA ASKGDDSTPT LLMYRVGDKP 

DNYDQLIDNA NKI IEKKIGA KLKMEFVGWG DWDQKMSTIV ASGESYDISL AQNYATNAQK 

GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 

NLDISKVDGS YESATEVLKE FXKXXPNIAA FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 

GSPKI INQYA DKDMINNLKV LHQWYKDGLI PTDAATSTTP YDLNTNTWFM RQETQGPMDY 

GDTILTQAAG KPLVSRPLTE PLKTTAQAQM ANYWANTSK NKEKSVELLG LLNSNPELLN 

GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH LSAWNTGNNL IIWPEESVTE EMVKERDKS I 

EEAKDSPILG FTFVNDKVKT EITNVATVMN RYAASLNTGT VDPEETLPKL MDDLKTAGWD 
KVQKEMQTQL DEYIQSQK 



EF010-3 (SEQ ID NO : 3 5 ) 
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GT GC3GTAAATC TTCAAAAGAT 

GCA3TGTCAA AAGG7GATGA TAGTACACCA ArGTTATTAA TGTATCGTGT TGGGGACAAA 

rCASATAATT ATGAC CAAT7 AATCGATAAT GGGAA7AAAA TTATCGAGAA AAAAATTGGG 

GG AAAA7TAA AAA T G G AA 7 7 7GTTGG77GG GG:GA77GGG ACCAAAAAA7 GTCAACAATC 

G77GCT7CTG GTGAAAGC7A TGATA7TTCA 7TAGCACAAA ATTATGCAAC GAATGCACAA 

AAAGGGGCCT A7G G7GA777 AACT3A777A G TAG G7AAA7 ATGCGAAAGA AGCGTATGAT 

:A>\TTG^CA3 ataactatat taaaggaaat aggattaatg GAAAACTGTA TGCGTTCCCA 

ATTTTAGGTA ACTCT7ACGG TCAACAAGTT TTAACTTTTA ATAAAGAATA TG T C GAT AAA 

7ACAAT7TA3 ATATTAGTAA AGTCGATGGT A G TTATG AAA GTGGAACGGA AGTTCTAAAA 

GAATTC I NT A AAAANGANCC AAATA77GCT GC7TTTGCTA TCGGCCAAAC ATTC7TTGCA 

ACAGGTAATT ATGACTTCCC TATTGGTAAC CAATATCCAT TTGCAGTAAA AACAACTGAT 

A C TGGCTC AC CAAAAATTAT TAACCAATAT GCCGACAAAG ACATGATTAA TAACTTAAAA 

GTCTTGCATC AATGGTATAA AGATGGCTTG ATTGCAACAG ATGCTGCTAC AAGTACAACA 

rCATATGACT TAAATACCAA 7ACT7GGTTT ATGCGTCAAG AAACACAAGG ACCTATGGAT 

TATGGTGATA CAATCTTAAC A GAAGCTGCT GGCAAACCAG TTGTTTCTCG TCCACTAACA 

GAACCATTAA AAACAACAGC TGAAGCGCAA ATGGCTAACT ATGTTGTTGC AAACACGTCT 

AAAAACAAAG AAAAATCTGT TGAATTGTTA GGTTTATTAA ACAGCAATCC AGAATTGTTA 

AACGGACTTG TTTATGGTGA AGAAGGCAAA CAATATGAAA AAGTTGGCGA TGATGGTGTG 

AAATTGTTGA AAGATTACAC ACCAACAACT CATTTGAGTG CTTGGAACAC AGGAAACAAC 

7TAATCA7T7 GGCCAGAAGA ATCTGTCAC7 GAAGAAATGG TTAAAGAACG TGATAAGAGC 

ATCGAAGAAG C AAAAG ATTC ACCAATTCTT GGTTTTACTT TTGTAAATGA TAAAGTGAAA 

ACTGAAATCA CTAACGTTGC TACAGTTATG AACCGTTACG CAGCAAGCTT AAATAC AGG A 

ACTGTTGATC CAGAAGAAAC ACTTCCAAAA TTAA7GGATG ACCTAAAAAC AGC TGGCTGG 

GATAAAGTTC AAAAAGAAAT GCAAACACAA TTAGACGAA7 ATATCCAATC TCAAAAA 



EF010-4 (SEQ ID NO : 3 6 ) 

CGKSSKDA A5KGDDSTPT LLMYRVGDKP 
DNYDQLIDNA NKI IEKKIGA KLKKEFVGWG 
GAYADLTDLA PKYAKEAYDQ LPDNYIKGNT 
NLDISKVDGS YESATEVLKE FXKXXPNIAA 
GSPKI INQYA DKDMINNLKV LHQWYKDGLI 
GDTILTQAAG KPLVSRPLTE PLKTTAQAQM 
GLVYGEEGKQ YEKVGDDRVK LLKDYTPTTH 
EEAKDSPILG FTFVNDKVKT E I TNVATVMN 
KVQKEMQTQL DEYIQSQK 



DWDQKMSTIV ASGESYDISL AQNYATNAQK 

INGKLYAFPI LGNSYGQQVL TFNKEYVDKY 

FAIGQTFFAT GNYDFPIGNQ YPFAVKTTDT 

PTDAATSTTP YDLNTNTWFM RQETQGPMDY 

ANYWANTSK NKEKSVELLG LLNSNPELLN 

LSAWNTGNNL IIWPEESVTE EMVKERDKSI 

RYAASLNTGT VDPEETLPKL MDDLKTAGWD 



EF011-1 ( SEQ ID NO:37) 

TAACGTTTTT GGAGGAAAAG AATGAAAAAG 
ATGGGACTGT TAATGTTAAG TGCTTGTCAA 
ACAGAAACAA CAGCTAAAAC GGAAGTCACA 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG 
AAATAC C AAA AAGTTGAATC AGCAGGCGGC 
CAACTAAAAC CAGACTTAAT TATTATTTCT 
AAAGCCATTG CGCCAACCAT TTACTTAGCT 
AAACAAAATA TCGAAACGTT AGGCACTATT 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC 



AAATTTTTAG CAATGATGGC AGTTTCAATG 
ACAAATAAAA AAACAGCAGA TTCTGCAACA 
GTCAAAGACA CCAATGGTCA ATTAACCGTT 
GATAATGGTT CCTTGGATAC AATGGATGCA 
CCAACTAAAA ATATCCCTGC GTATTTGAAA 
ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
GGTCGTCAAC AAGATTATCA AGAACAATTA 
GTAGATGCCA AAAATCCTTG GGCATCAACG 
TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
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ACTCATGGGC AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT 

GTGGTAGATC GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC 

GAATTGATTC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT 

GTTTGGTATC TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA 
AAAGGATTAG AGTAA 

EF011-2 { SEQ ID NO : 3 8 ) 

MKKK FLAMMAVSMM GLLMLSACQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 

KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 

LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI 

TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 

HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF011-3 { SEQ ID NO : 3 9 ) 



TTGTCAA ACAAATAAAA AAACAGCAGA TTCTGCAACA 

ACAGAAACAA CAGCTAAAAC GGAAGTCACA GTCAAAGACA CCAATGGTCA ATTAACCGTT 
CCCAAAAATC CTAAGAAAGT CGTTGTTTTT GATAATGGTT CCTTGGATAC AATGGATGCA 
CTAGGTGTCG GTGACCGCGT GGTAGGTGCG CCAACTAAAA ATATCCCTGC GTATTTGAAA 
AAATACCAAA AAGTTGAATC AGCAGGCGGC ATTAAAGAAC CAGATTTAGA AAAAATCAAT 
CAACTAAAAC CAGACTTAAT TATTATTTCT GGTCGTCAAC AAGATTATCA AGAACAATTA 
AAAGCCATTG CGCCAACCAT TTACTTAGCT GTAGATGCCA AAAATCCTTG GGCATCAACG 
AAACAAAATA TCGAAACGTT AGGCACTATT TTTGATAAAG AAGAGGTAGC TAAAGAAAAA 
ATAACTGGCT TAGAAAAAGA AATTGCTGAC GTGAAAAAAC AAGCAGAAGC TAGCGCGAAT 
AATGCGCTTG TTGTGTTAGT TAACGAAGGA CAACTTTCCG CTTACGGAAA AGGCTCTCGT 
TTCGGTTTAA TTCATGATAC ATTTGGCTTC AAAGCAGCAG ACGATAAGAT TGAAGCTTCC 
ACTCATGGGC AAAGTGTTTC TTACGAATAT GTTTTAGAAA AAAATCCTGG GATTCTCTTT 
GTGGTAGATC GCACCAAAGC AATTGGTGGC GACGATTCAA AAGATAACGT CGCTGCAAAC 
GAATTGATTC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATTATGCT TCAACCAGAT 
GTTTGGTATC TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA 
AAAGGATTAG AG 



EF011-4 (SEQ ID NO : 4 0 ) 

CQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP 

KNPKKVWFD NGSLDTMDAL GVGDRWGAP TKNIPAYLKK YQKVESAGGI KEPDLEKINQ 
LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF DKEEVAKEKI 
TGLEKEIADV KKQAEASANN ALWLVNEGQ LSAYGKGSRF GLIHDTFGFK AADDKIEAST 
HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV 
WYLSGGGLES MHLMIEDVKK GLE 



EF012-1 (SEQ ID NO:41) 

TGAGGGGGCA ACAACATGAA ATTGGGGAAA AAAGTAGTAG GTTTGATTGC AACAGGGTTT 
CTTTTAGCCG CATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA 
GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA 
CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG 
TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 
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■ _ G — C C C AAG 

AATGCC7ATT 
GTCGATGAAT 

g:::aacctt 

TTT'GTC 3AA.G 

GAATACTATG 
GATAATACTG 
CAATATGTTC 
TACTTCT7AG 
GCGATTGGCC 
AAACCCCTTA 
TTCCGAGCTT 
ACGAAAGCCC 
GATCAAGGAA 
TTAGAAATTA 
AATTATGAGT 
AACTTATATG 
CAATTGGTAG 
TACAAAGAAG 
AGTGCCTCAA 
GATTATTTCC 



A C C A C TTT A C 
ATTTTGTTTA 
TAGTAGACAG 
TAGGGAT ^ TC 
CCTTCTTAGC 
CGCAA.GGC AA 

ATGCGGATCA 
GGATTAACTT 
AA2AATAT:A 
ATTTCAACAA 
AAGCAATTGA 
ACGGATTGAT 
ACAGTGGCGA 
AAG TGGATGT 
AA CG AATTGC 
CCATTTCATC 
TGTCTCTTTC 
CAGGAGAATC 
AAGAGGCACG 
CGGAAGACAT 
ATTATCTAAT 
ACTTGCGCAA 




AGTGAAACTG 
A7ATCAAGTG 
AGATGATCCA 
AAAAG AA' 3 G A 
TAAAGAAGCC 
TCCAAGTAAA 
ATA TTT G AAA 
C G G T AAAAAA 
TGAATATGTT 
GCAACCAAGT 
AGGATGGATT 
AAGTTACAAT 
AACGATTAAT 
CTTGTTGAAC 
CAATCCTAAA 
TGCCTATTTA 



GGGATTAAGT 
AAACTGGTGA 
A G T T T T G A. AA 
GACAAAGAAT 
A7TGC77G3T 
- TGvjATAGTG 
TCA I! ATACTT 
GAAGAAGTTG 
AATGAACTAG 
GoCTATGTCA 
ACGCCATTAG 
r rTAACACAAA 
C TTTATGCGA 
AATGACGTCA 
GTGAAACTTT 
CAAAGTCAGT 
AATAATGTGA 
GCCGGCAGTA 
TACGGCAATT 
GCCAATAATC 
CAAGATGCTG 
TTGAAAGGCA 
ACAGAATGA 



GGAGCAACGG 
CACCAGCGAC 
TACGGAACGG 
TCATTGTTGA 
TAGCGCCACA 
AACATTTACT 
GGACATTGAA 
CGGTTAGCAC 
ACTTAGTTCG 
GTCATCCAGA 
CGAATGTTCA 
GTGTCTTAAA 
ATCCAGAAAC 
AAAAAGCTCA 
CATTGCTGGC 
TGCAAGAAAA 
ACCAATCGCG 
GTGAATTAGA 
ATCATAATGC 
CAGAGAAACA 
CCCAAGTACC 
TTAGTTATCA 



CGAGCCAATC 
GA.TTGGACCG 
TGAAAAGTCA 
ATTAAAACAG 
AAATCAAAAA 
TTATAGCGGG 
AJLAAAA7CCA 
AATCAAAGAA 
CATTAACGGA 
TGTGGCCAAC 
TTTACGAAAA 
CGATGGGTCA 
GGATGAAGAT 

ag:tgaatgg 
ggcagacaca 
tctgccaggt 

ACGTGAAAAA 
CTCTTACTTT 
CAAATACGAC 
GTTTGCAGAA 
GCTGTATCAA 
CTTGTATGGG 



EF012-2 (SEQ ID NO:42) 



MKLGKK WGLIATGFL LAACGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 
TTCKNTFTKA QHLFEGLYRF CDDSATVPAL AKDVKISDDG RKYHFTLREG IKWSNGEPIT 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV DELGISAPND KEFIVELKQA 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP FTLANWDATS DTWTLKKNPE 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ YVQQYQDDPG YVSHPDVANY 
FLDFNKKEGT PLANVHLRKA IGQAIDKEAL TQSVLNDGSK PLNGLIPSKL YANPETDEDF 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD QGKRIAEYVQ SQLQENLPGL 
EITISSQPSN NVNQSRREKN YELSLSGWI A GSSELDSYFN LYAGESSYNY GNYHNAKYDQ 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS ASNYLINPKL KGISYHLYGD 
YFHLRNAYLT E 



EF012-3 ( SEQ ID NO:43) 

ATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA 

GCAGCTGAAC AAAAAATCAG TATTAGTTCA CCTGCACCAA TCTCAACATT GGATACAACA 

CAAACAACAG ATAAAAATAC CTTTACAATG GCACAACATT TATTTGAAGG CCTTTATCGG 

TTTGATGATG ATAGTGCCAC GGTGCCAGCT CTAGCTAAAG ATGTCAAGAT TAGTGACGAT 

GGGCGCAAGT ACCACTTTAC CTTGCGGGAG GGGATTAAGT GGAGCAACGG CGAGCCAATC 

ACGGCCCAAG ATTTTGTTTA TTCTTGGAAA AAACTGGTGA CACCAGCGAC GATTGGACCG 

AATGCCTATT TACTAGACAG TGTTAAAAAT AGTTTTGAAA TACGCAACGG TGAAAAGTCA 

GTCGATGAAT TAGGGATTTC AGCCCCGAAT GACAAAGAAT TCATTGTTGA ATTAAAACAG 

GCCCAACCTT CCTTCTTAGC AGTCGTTTCG ATTGCTTGGT TAGCGCCACA AAATCAAAAA 

TTTGTCGAAG CGCAAGGCAA AGATTACGCC TTGGATAGTG AACATTTACT TTATAGCGGG 

CCATTTACGC TAGCCAATTG GGATGCGACT TCAGATACTT GGACATTGAA AAAAAATCCA 

GAATACTATG ATGCGGATCA AGTGAAACTG GAAGAAGTTG CGGTTAGCAC AATCAAAGAA 

GATAATACTG GGATTAACTT ATATCAAGTG AATGAACTAG ACTTAGTTCG CATTAACGGA 

CAATATGTTC AACAATATCA AGATGATCCA GGCTATGTCA GTCATCCAGA TGTGGCCAAC 
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TACTTCTTAG ATTTCAACAA AAAAGAAGGA ACGCCATTAG CGAATGTTCA TTTACGAAAA 
GCGATTGGCC AAGCAATTGA TAAAGAAGCC TTAACACAAA GTGTCTTAAA CGATGGGTCA 
AAACCCCTTA ACGGATTGAT TCCAAGTAAA CTTTATGCGA ATCCAGAAAC GGATGAAGAT 
TTCCGAGCTT ACAGTGGCGA ATATTTGAAA AATGACGTCA AAAAAGCTCA AGCTGAATGG 
ACGAAAGCCC AAGCGGATGT CGGTAAAAAA GTGAAACTTT CATTGCTGGC GGCAGACACA 
GATCAAGGAA AACGAATTGC TGAATATGTT CAAAGTCAGT TGCAAGAAAA TCTGCCAGGT 
TTAGAAATTA CCATTTCATC GCAACCAAGT AATAATGTGA ACCAATCGCG ACGTGAAAAA 
AATTATGAGT TGTCTCTTTC AGGATGGATT GCCGGCAGTA GTGAATTAGA CTCTTACTTT 
AACTTATATG CAGGAGAATC AAGTTACAAT TACGGCAATT ATCATAATGC CAAATACGAC 
CAATTGGTAG AAGAGGCACG AACGATTAAT GCCAATAATC CAGAGAAACA GTTTGCAGAA 
TACAAAGAAG CGGAAGACAT CTTGTTGAAC CAAGATGCTG CCCAAGTACC GCTGTATCAA 
AGTGCCTCAA ATTATCTAAT CAATCCTAAA TTGAAAGGCA TTAGTTATCA CTTGTATGGG 
GATTATTTCC ACTTGCGCAA TGCCTATTTA ACAGAA 



EF012-4 (SEQ ID NO:44) 

CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ 
TTDKNTFTMA QHLFEGLYRF DDDSATVPAL AKDVKISDDG 
AQDFVYSWKK LVTPATIGPN AYLLDSVKNS FEIRNGEKSV 
QPSFLAWSI AWLAPQNQKF VEAQGKDYAL DSEHLLYSGP 
YYDADQVKLE EVAVSTIKED NTGINLYQVN ELDLVRINGQ 
FLDFNKKEGT PLANVHLRKA I GQ A I DKEAL TQSVLNDGSK 
RAYSGEYLKN DVKKAQAEWT KAQADVGKKV KLSLLAADTD 
EITISSQPSN NVNQSRREKN YELSLSGWIA GSSELDSYFN 
LVEEARTINA NNPEKQFAEY KEAEDILLNQ DAAQVPLYQS 
YFHLRNAYLT E 



EF013-1 (SEQ ID NO:45) 

TAACGAAAAA TGAAAAAAAT TGCTTTGTTC AGTATGTTAA CGTTCAGTGT ATTGTCTTTA 
AGTCTAGCAG G ATG TGG AAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCACGG GTCAAAATTC TAATGTGACA GGGGAAAATC CGTCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGATGA AACGAATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTCAATCCT GAAAGAAATG AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AACGGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTTTACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACGGAATAGT GCACCTGTCC AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA ATTAA 

EF013-2 ( SEQ ID NO:46) 

MKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 
VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 



RKYHFTLREG 
DELGISAPND 
FTLANWDATS 
YVQQYQDDPG 
PLNGLIPSKL 
QGKRIAEYVQ 
LYAGESSYNY 
ASNYLINPKL 



IKWSNGEPIT 
KEFIVELKQA 
DTWTLKKNPE 
YVSHPDVANY 
YANPETDEDF 
SQLQENLPGL 
GNYHNAKYDQ 
KGISYHLYGD 
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Er 013-2 ( SEQ ID NO: 47 1 

ATGTGGAAA C AAAAAAA C A GCAAGCACAA ATGATTCTAA GCCAAAGCAA 
GAAACAAAGA AAGCCACGCA GAAATCCTCT AGCCAACAAG AAATGAAAAG TAGTCATTCG 
TCTGTCAGGG GTCAAAATTC TAATGTGACA GGGGAAAATC CG TCAGAAAA TGCCACGCAG 
CCTTCTGCAG GAACTGATGA AAC3AATGAA GTCCCTCAAA ACCAAGCACC TGATACAAAC 
ATTACAATTA CCAATGTTGT TTTTAATCCT G AAA G AAA T 1 3 AAATTAATGG TACTACATTA 
CCTAATGCAA CCATTACAGC AAC GGTAGTC GGTGATGCTT CTGCACAAGC AGGTGTTTTT 
TATGCGGATG CCAATGGCAA TTT7ACAGTA ATTAGTCCCA GAGCGGGAGC GACTACTCAA 
TTAATCGCAA CCGTTGATCA ACG 2AATAGT GCACCTGTCr AAATTGATAT TCCAAGTTCA 
GGACAAGAAG CAGCGCTTTC TTTTAGCAAT ATTACGATTG ATCCGAAACA AGGGACAATT 
TCTGGTAAAA CAGCACCGAA TGCAACTATT TTAGTGTCAC GTGCAGATGA TGCGCGGGTG 
ATTTTAGCAA GTTTTACTGC GGATGCCCAA GGGAATTTCA CAGCCAGTAA TTTAGTTCCC 
GGCACAAAAA ATCGCTTAGA TGTTACGTTA AATGGAGAAA TAGGGACACC TTACTTGTTT 
GATTTACCAA AT 

EF013-4 { SEQ ID NO:48) 

CGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS 

VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWFNPE RNEINGTTLP 
NATITATWG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG 
QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG 
TKNRLDVTLN GEIGTPYLFD LPN 

EF014-1 (SEQ ID NO:49) 

TGATGGTGGA GACTTTTTAA GAGAGAGGAA GTACAGCCAA TGAGTAGGAA GCGAAAAATC 
AGCTTAATTA GTTTAGTCAT CATTTTGGTT TTTGTCACAG TCGGCTCAGC ATACTTTGCT 
GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 
TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 
ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGG TGATTAC CGTGAATAAC 
AAGACGAAGA AAATAACCTA TTTAAGTTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 
AAAAATTACC AAGGGATGCA GCGAATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 
TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 
TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 
GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 
CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 
ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 
CAATCAATCA TGAAAATAAT GGACATTATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 
GTGGATTCCA ATGAATTGAC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 
AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 
CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 
AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 
CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTC CATC AAC 
GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA ATAA 

EF014-2 (SEQ ID NO:50) 

MSRKRKIS LISLVIILVF VTVGSAYFAV AGSYLKKTID KGYVPIKNDY 
NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GG I DVNVKQA 
FDGVTKDGPG SIHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 
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VPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF014-3 ( SEQ ID NO:51) 

TGCT 

GTAGCGGGTA GCTATTTAAA GAAAACAATT GATAAAGGCT ATGTTCCCAT AAAAAATGAT 

TATAATGAAG CGCAAAATAA AGATAGTCAA TCGTTTTTGA TTATGGGGCT AGACAATACA 

ATTGAACGGA AATTAGGCAC AACTAGGACT GATGCTATGA TGGTGATTAC CGTGAATAAC 

AAGACGAAGA AAATAACCTA TTTAAG TTTG CCACGGGATA GTTTTGTTCA AATTGATGCG 

AAAAATTACC AAGGGATGCA GCG AATTGAA GCCGCCTATA CCTACGATGG ACCAACAGCT 

TCTGTTAACA CAGTTGAGAA ATTATTGAAT ATTCCAATCA ATCATTACGT TGTGTTTAAC 

TTTTTATCTT TTATTAAGTT AATTGATGCG GTTGGCGGCA TAGATGTCAA TGTCAAGCAG 

GCGTTTGATG GTGTCACCAA AGACGGGCCA GGATCCATTC ATTTTGATGC AGGGAAACAG 

CATTTAGATG GTACGAAAGC TTTATCTTAT GCCCGTGAAA GACATAGCGA TAACGATATT 

ATGCGTGGAT TCCGACAACA AGAAATTATT CAAGCAGTTG AAGACAAGTT GAAATCTGGT 

CAATCAATCA TGAAAATAAT GG AC ATT ATT GATTCGTTAA ATGGAAACAT TCAAACTGAT 

GTGGATTCCA A TG AATTG AC TCATTTAGTC AAAGAAGGTT TGACTTGGAC CAATTATGAT 

AAACAACAGC TTTCTTTTGA CTGGCGCACT TTTAGTAATG AAGGGCGCAG TATGGTTGAA 

CTATACCCAG ATAGTATTGA AAATGTCCGT CATCAATTAC GTGTGTCTTT AAATTTAGAA 

AAGCCAGATG AACGAGATCA AGACGGCTAT GTCTTCCATA CGAACGGTGA ATTTTTATAT 

CAAAGTGATT ATACCGTTCA AGATGAAGCA GCTGAGGAAA ACGAAATGAC TTCCATCAAC 

GGCAATACGT ATATTGGTGT TCCTGGTAAT ACACAGACCG GCCCGTTGCC ATCAGTTAAA 
ACGGAAAATG GCTTTATAAA A 



EF014-4 { SEQ ID NO:52) 
AV AGSYLKKTID KGYVPIKNDY 

NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP RDSFVQIDAK 
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHYWFNF LSFIKLIDAV GGIDVNVKQA 
FDGVTKDGPG S IHFDAGKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ AVEDKLKSGQ 
SIMKIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF SNEGRSMVEL 
YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFLYQ SDYTVQDEAA EENEMTSING 
NTYIGVPGNT QTGPLPSVKT ENGFIK 

EF015-1 (SEQ ID NO:53) 

TAATTAAAAA TGTGTAAAAA GGGTCTGATG AAAAAAGGAG ACATAATAGT TATTATCTTT 
TTAATAGCTA TCTCTTTTTC TCCATATTTT ATTTTTTTTC ACAATAATCC ATTTAACTCC 
AAAAGTTTTG ACGACACTAA ATATGCTGTG GTCAAGATAG ATGGGAAAGA GATTGAGCGT 
ATAAATTTAG ATGATTCAAA AGAATTTATC AAAACATATT ATC CATC AAA AGGGCAATAT 
AATACTATAG AAGTTAAAAA TGGGCACGTT CGTGTAAAAA AAGATAATAG TCCAGATCAA 
ATTGCGGTGA AAACAGGATG GATATCAGAA CCAGGGCNAA CTAGTATCTG TATTCCTCAC 
AGATTCATTT TAGAAATTGT TCAACAATAT TCTAAGGATT ATTATATTTA CTAA 

EF015-2 (SEQ ID NO:54) 

MK KGDIIVIIFL IAISFSPYFI FFHNNPFNSK SFDDTKYAW KIDGKEIERI 
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KDYYIY 

EF015-3 (SEQ ID NO:55) 
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CAATAATCC AT77AACTCC 

AAAA G T TTT i - : ACGA3AC7AA ATA7GC7G7G G7CAAGA7AG ATGGGAAAGA GATT3AGCGT 
A7AAATTTAG A 7G A TTC AAA AGAAT77A7C AAAA C A 7 A 77 ATCCATCAAA AGGG CAATAT 
AA7ACTATAG AAGT7AAAAA TGGGCACGTT C G TG 7 AAAAA AAGATAATAG TCCAGATCAA 
AT7GGGGTGA AAAGAGGA7G GA7A7CAGAA CCAGGGCNAA CTAGTATC7G TATTCCTCAC 
AGA7TCATT7 7AGAAA77G7 7GAACAA7AT 7"7AAGGA7T ATTATATTTA C 



EFC15-4 (SEQ ID NO : 5 6 ) 
NWFNSK SFDDTKYAW KIDGKEIERI 

NLDDSXEFIK TYYPSKGQYN 71 EVKNGHVR VKKDNSPDQI AVKTGWISEP GXTSICIPHR 
FILEIVQQYS KGYYIY 



EFC16-1 ( SEQ ID NCiE' 7 ) 

TGACGGTTGC CGCCGTCCAA TAGAAAGGAG 
TTGCTGGT7A TCTGTTGTAG TTTAGTCC7A 
GAAGATCAA7 GGACAGGGA7 TAACGAAGAA 
TT7G7GCCGA TGGGTTTTCA AGATAAATCA 
GCCAAAGCGG TT7TTAAAC7 TTA7GGCA7T 
ATGAAAGAAA CAGAATTACA AAA7CAAACC 
ACGAGCGAGG GG3CCGAAAA AGTTCAATTC 
CTTGTTTC7T TAAAAGAAAA AAACATTGCA 
GGGGTTCAAA ACGGCTCTTC 7GGCTATGAT 
AAATTTGT7A AAGACCAAAC ACC7ATTTTA 
TTAAAATC7G GTCGAATTGA CGGACTCCTA 
TCCGACGAAG ATAA7TTAAA AAAC T AT ACT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAA7 
GAAACGTTAC GAAAAGATGG CACCCTTAG7 
GTTACAAATA ACACAAAAAT AAACTAA 



TTTATGATGA AAAAGAAATA 7TCTTTAGCC 
TTTGCAGGTT GTGGTAAAAG AAAAAGCAAC 
AAACGGA7TA TTATTGGCTT AGATGAGTCC 
GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATTGATCTTA TTTGGAACGG CTACACTAAA 
aca:aacctt ACATGACGAA CGACCAAGTA 
ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGCTTCGAAA GTCAGCCTGA C G TTTTG AAA 
7A7GACGGCT TTAATGAAGC TTTCTTAGAT 
ATCGATCGCG TTTACGCCAA CTACTATCTT 
AT77CTCATG TAGGCTATGA CAATGAAGAT 
CAA7TAGTCC AAAAAATCAA TACTGCCTTT 
AAAATTTCTC AAAAATGGTT TGGAGAGGAC 



EF016-2 (SEQ ID NO:58) 

MMKKKYSLAL LVICCSLLLF AGCGKRKSNE 
VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS 
SERAEKVQFT QFYMTNDQVL VSLKEKNIAT 
FVKDQTPILY DGFMEAFLDL KSGRIDGLLI 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK 



DQWTRINEEK RII IGLDDSF 
VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
ISQKWFGEDV TNNTKIN 



EF016-3 (SEQ ID NO:59) 
AAGCAAC 

GAAGATCAAT GGACACGGAT TAACGAAGAA 
TTTGTGCCCA TGGGTTTTCA AGATAAATCA 
GCCAAAGCGG TTTTTAAACT TTATGGCATT 
ATGAAAGAAA CAGAATTACA AAATCAAACC 
ACGAGCGAGC GGGCCGAAAA AGTTCAATTC 
CTTGTTTCTT TAAAAGAAAA AAACATTGCA 
GGGGTTCAAA ACGGCTCTTC TGGCTATGAT 
AAATTTGTTA AAGACCAAAC ACCTATTTTA 
TTAAAATCTG GTCGAATTGA CGGACTCCTA 



AAAC GG ATT A TTATTGGCTT AGATGACTCC 
GGCAAAATTG TCGGCTTTGA TGTCGACTTA 
TCCGTTGACT TCCAACCGAT TGATTGGTCT 
ATTGATCTTA TTTGGAACGG CTACACTAAA 
ACACAACCTT ACATGACGAA CGACCAAGTA 
ACAGCGAGCG ACATGCAAGG CAAAATTTTA 
GGCTTCGAAA GTCAGCCTGA CGTTTTGAAA 
TATGACGGCT TTAATGAAGC TTTCTTAGAT 
ATCGATCGCG TTTACGCCAA CTACTATCTT 
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TCCCACGAAG ATAATTTAAA AAACTATACT 
TTTGCTGTGG GCGTCCGCAA ATCAGACAAT 
GAAACGTTAC GAAAAGATGG CACCCTTAGT 
GTTACAAATA ACACAAAAAT AAAC 

EF016-4 (SEQ ID NO: 60) 



ATTTCTCATG TAGGCTATGA CAATGAAGAT 
CAATTAGTCC AAAAAATCAA TACTGCCTTT 
AAAATTTCTC AAAAATGGTT TGGAGAGGAC 



SNE DQWTRINEEK RI I IGLDDSF 

VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS VDFQPIDWSM KETELQNQTI DLIWNGYTKT 
SERAEKVQFT QPYMTNDQVL VSLKEKNIAT ASDMQGKILG VQNGSSGYDG FESQPDVLKK 
FVKDQTPILY DGFNEAFLDL KSGRIDGLLI DRVYANYYLS HEDNLKNYTI SHVGYDNEDF 
AVGVRKSDNQ LVQKINTAFE TLRKDGTLSK ISQKWFGEDV TNNTKIN 



EF017-1 (SEQ ID NO: 61) 

TGAGGTGTTT TTATGAAAAG GGCAACAAAG CAAAGGCTGT CTTTGGCAGC AATCATGGTT 
CTACTTCTCT CGGGCTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 
TTACGGGTCG GGATTGATTC GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
ACCGCAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAAGCGAAAC CAGCATTGGC AACTGAAAAA GTACAGCCCT CCAATGATGG TTTAAGCTAT 
ACTTTTACGA TTCGAAAAGA TGCAAAATGG AGTAACGGCG AGCCAATCAC AG C AAATG AT 
TTTGAATACT CTTGGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA AGCGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTATT GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGGGTAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTCGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATCTTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTGAAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AG AAAAAG C G AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTCAATGTT 
TCACCAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA GCGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAG AC CT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GCGACAAGTA A 



EF017-2 (SEQ ID NO:62) 

KKRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 
AADVNSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 
EYSWKRTVDP KTASPQAYYF EGLKMYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 
QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNYWDQ 
ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALK I R EIPGTYYIQL 
NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 
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LKFSEPEKAK ELW^AKKEL G I EKAELT I L. 
PVF FNNRLEK SRSGEFi; I'/V GGWTPVYAE? 
ITvTYANKYEE RWKTLQKAI)* LVAEEAFIV? 
SIGEK 

EF017-3 (SEC I" NO: 63 ) 



:;sete::akki seyvqgai^ad xlenltvnvs 
leflnleqsk nsnnfgkwsn xtfdc'llqea 

LYQLTEARLV ADSVQNLVYG F1GSGYYKSV 



CTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA 

TTAGGGGTCG GGATTGAT7C GGAATTATCA ACGGCAGACG TGTCGTTGGC AATGGATAAT 
AT2^CAGCAG ATGTAATGAG CCAAGTAGGG GAGGGACTTT TCTCCTTTGA CGAAAAAGGA 
GAA3CGAAA7 CAGCATTGGZ AACTGAAAAA GTACAGCCCT CCAATGATGG tttaagctat 
ACTTTTACGA TTCGAAAA3A TGCAAAATGG AGTAACGGCG AGCCAATCAC A3CAAATGAT 
TTTjAATACT CT^GGAAGCG CACAGTGGAC CCAAAAACAG CTTCCCCGCA A 3CGTATTAC 
TTTGAAGGGT TAAAAAATTA TCGTGCTAT7 GTTGACGGTA GCAAATCTAA AGAAGAGTTA 
GGG5TAACAG CCATTGATGA CCATACCTTG GAAGTAGAGC TAAGCTATCC TATGAGTTAT 
TTTCAACAAT TATTGGCGGT ACCAGCTTTT TATCCTTTAA ATGAAGCATT TGTCGAAAAA 
ACGGGCAAAA ACTATGGTAC ATCAGCTGAG TCAACACTTT ACAATGGCGC CTTCACATTA 
GAAGGTTGGG ATGGCACGAA TAATACTTGG TCCTATGTGA AGAATAAAAA TTATTGGGAT 
CAAGCGAATG TTTGGCTAGA TAAGGTGGAT GTCCAAGTAG TTAAAGAAGT CAATACTGGG 
AAAAATCTTT TCGAAGGGAA AGAATTAGAT GTTGTAAAAA TTTCTGGAGA AATTGTTGCA 
CAAGAACAAG GCAATGCAGC TTTG AAAATT CGTGAAATTC CTGGAACGTA TTATATCCAA 
TTAAATACGC AAAAAGATCT TTTGGCAAAT AAGAATGCAC GTCGAGCAAT AGCATTATCA 
TTGAATTCTG AGCGTTTAGC TAAAAATGTT TTAAATGATG GCTCAAAAAA AGCACTTGGC 
TTCGTGCCAA CAGGTTTCAC TAATCAAGAA ACGCAAAAAG ATTTTGCAGA GGAATTAGGA 
GATTTAAATC CTAGTGAACC AGAAAAAGCG AAAGAGTTAT GGCAAACGGC TAAAAAAGAA 
TTAGGAATTG AAAAAGCGGA GCTAACGATT TTAAGTTCGG ATACAGAAAA TGCTAAAAAA 
ATCAGTGAGT ATGTTCAAGG AGCTTTAGCA GATAATTTAG AAAATTTAAC AGTGAATGTT 
TCA^CAGTTC CTTTTAATAA TCGTTTAGAA AAAAGTCGCA 3CGGAGATTT CGACATTGTG 
GTTGGTGGCT GGACGCCAGT ATATGCTGAT CCAATCGATT TCTTAAACTT ACTGCAATCA 
AAAAATTCCA ATAATTTTGG TAAATGGTCT AATAAGACCT TTGATCAGTT GCTTCAAGAA 
GCAAACGTAA CTTATGCAAA TAAATATGAA GAACGTTGGA AAACATTACA AAAAGCGGAT 
CAATTGGTTG CGGAAGAAGC CCCCCTAGTT CCTCTTTATC AATTAACAGA AGCACGCTTA 
GTGGCCGATT CTGTCCAAAA TTTAGTCTAT GGTCCATTAG GTTCAGGCTA TTACAAATCA 
GTCTCTATCG GGGACAAG 



EF017-4 (SEQ ID NO:64) 

CGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT 

AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF 

EYSWKRTVDP KTAEPQAYYF EGLKNYRAIV DGSKSKEELG VTAIDDHTLE VELSYPMSYF 

C'C'LLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTWS YVKNKNTYWDQ 

ANVSLDKVDV QWKEVNTGK NLFEGKELDV VKISGEIVAQ EQGNAALKIR EIPGTYYIQL 

NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKKALGF VPTGFTNQET QKDFAEELGD 

LNPSEPEKAK ELVJQTAKKEL GIEKAELTIL SSDTENAKKI SEYVQGALAD NLENLTVNVS 

PVPFNNRLEK SRSGDFDIW GGWTPVYADP IDFLNLLQSK NSNNFGKWSN KTFDQLLQEA 

NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV 
SIGDK 



EF018-1 (SEQ ID NO:65) 

TGTCATTACA ACGATACCAA TTTTAATCAT 
CGGTATGATG GCCGGTGCAG TAAAAGAATA 



TTATCCATTA CTACAAAAAC ACTTTATCGG 
AAGAAAGTAG GGAACAATAT GAAAAAAGTT 
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TTAGGCGGTT TATTGGTGGC AACGGCGGTC GTTAGTTTAG CGGCCTGTAG CGGTGGGGAA 

AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 

GCTTGGAAGT T AG AT AA G AA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 

TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AGACTTAAAT 

GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 

GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT 

TCTTGGGCCT A TTC TTT AAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 

GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT 

AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TG AT AATTTT 

GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 

TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 

ACCACAGTGG GCGATGGTGC AGGACCATTT TTAGACAAAT TACAAGACTT CTTAGGTGTT 

CCTTTAGAGG ATAAAAATGG TAAATACTAT GATCGAAATT TAGATAAAGA ATATTTAGAA 

TGGTTAAAAA CATTTAATGA TGTTTACCGA GCAGGCAATA TTAGTGATGA TAGCTTCACA 

GATGATGGGG CAACGTTTGA TGAAAAAGTG AAACAAGGAA ATTATGCAAC CATGCTCGTT 

GCTGGAACCA GTGGTCAAGG TGGGAACTTC ACAGAATTTA TGAAAAAATC TGGCACACGT 

TATATAGCCA TTGATGGACC AAGTAGCACT TCTGGCCGAA AACCAACATT AAATCAAACC 

GGCATTTCAG GTTGGTTAAG TAATTACATT ACGAAAGATG CGAAAGATCC AGCAAAAGTC 

ACTCAACTGT TCACATATTT AATTGATGAA CCGGGACAAA TTTTAACAAA ATATGGCGTT 

GAAGGAGTTA CTTATGCGTA CAATGATCAA GGAAAAATTG ATTATTTACC AGAAGTGAAA 

AAATTAGAAC AAACAGACAA TGATGCCTAC AACAAAAAAT ATGGCATTAG TCGTTTCCTA 

TACTTTAACA ACGACCGTGT CAATAAACTA AAAGTACCAA TGGAAAGTGC TTTAACGCAA 

ATGCAAGAAT GGGGCAAAGG AAAATTAGTC CCACATTTCG TAATTGAAAA TATTAATCCA 

GATGCAGGAA CGCCGGAAGC TCGTGCGAAT GAAGCGATTG AAACCAAACT AAATACAACC 

GTTATTTCAA TGATTCGTGC GAAAGATGAT AAAGCCTTTG ACAAATCTTT AGAAGACTAC 

AAAGCATTCT TAAAATCAAA TAAATGGGAT GCAATTGAAA AAATAAAATC TGAGAAAATG 
GCGGAAAACA GAGACAAACT TAAGTAA 



EF018-2 (SEQ ID NO: 66) 

MKKV LGGLLVATAV VSLAACSGGE 
KKASSDVSIK DRYELDEKTP AWKLDKKKEP 
VDIEFISGDD SKLNAMISSG DMPDIVTLTE 
VNQDTFKWYA LEDGKTYGYP NYSNTKADYE 
FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF 
WLKTFNDVYR AGNISDDSFT DDGATFDEKV 
YIAIDGPSST SGRKPTLNQT GISGWLSNYI 
EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY 
MQEWGKGKLV PHFVIENINP DAGTPEARAN 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 

EF018-3 (SEQ ID NO: 67) 



TKIKWYINSD WTALPFGKDV TTAQIKKDLN 
KTGQAALKAD SWAYS LNDLA KKYDPYLMKV 
SGNIPVNDNF VIREDVYNAL GKPDVSTPEN 
LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 
KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 
TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 
NKKYGISRFL YFNNDRVNKL KVPMESALTQ 
EAIETKLNTT VISMIRAKDD KAFDKSLEDY 



CTGTAG CGGTGGGGAA 

AAGAAAGCTA GCTCAGATGT CTCAATTAAG GATCGGTATG AATTAGATGA AAAGACGCCT 
GCTTGGAAGT TAGATAAGAA GAAAGAACCG ACCAAGATTA AATGGTATAT TAACTCAGAT 
TGGACGGCGC TGCCTTTTGG AAAAGACGTG ACCACTGCGC AGATTAAAAA AGACTTAAAT 
GTGGATATTG AATTTATTTC CGGCGATGAT TCAAAATTAA ATGCCATGAT TTCAAGTGGA 
GATATGCCTG ATATCGTGAC ATTAACTGAA AAAACTGGAC AAGCAGCATT GAAAGCAGAT 
TCTTGGGCCT ATTCTTTAAA CGATTTAGCT AAAAAATATG ACCCCTATTT AATGAAAGTT 
GTTAACCAAG ATACGTTTAA ATGGTATGCC TTAGAGGATG GAAAAACATA TGGTTACCCT 
AATTACTCTA ATACAAAAGC GGATTATGAA AGTGGAAATA TCCCAGTAAA TGATAATTTT 
GTTATTCGTG AAGATGTCTA TAATGCATTA GGCAAGCCAG ACGTTTCAAC ACCAGAAAAT 
TTTGAAAAAG TCATGCAACA GATTAAAGAA AAATATCCTG AGATGACCCC AATGGGCTTC 
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g :gatggtg: 


AGGA.CCA. 




T T A G A C AAA. T 


TACAAGAGTT 


CTTAGGTGTT 


"CTTTAGAGG 


AT A/ukAA - GG 




TAT 


G ATCGAAjaTT 


TAGATAAAGA 


ATATTTAGAA 


TG G TT AAAAA 


TA _ TTAA ^ G„~s. 


tgttta: 


GGA 


-■j- .-»G'_ Aj-iTA 


TTAGTGATGA 


TAGCTTCACA 


GATGATGGGG 


TAA CGTT » GA. 




2TG 


rt^A rAAGGAA 


ATTATGCAAC 


CATGCTCGTT 


GC TGGAACCA 


3TGGT "AAGG 


TGGGAA : 


TTC 


/iC A 3.hATTTA 


TGAAAAAATC 


TGGCACACGT 


7ATATAGCCA 


TTGATGGACC 


AAGTAGC 


ACT 


TCT 3 CGAA 


AACCAACATT 


AAATCAAACC 


GG G ATT? GAG 




T AA.TT A C 


ATT 


ACG^AAGATG 


CGAAAGATCC 


AGCAAAAGTC 


ACTCAACTGT 


TCACATATTT 


AATTGAT 


GAA 


CCGGGACAAA 


TTTTAACAAA 


ATATGGGGTT 


GAAGGAGTTA 


CTTATGCGTA 


CAATGAT 


CAA 


GGAAAAATTG 


ATTATTTACC 


AG AAG TG AAA 


AAATTAGAAC 


AAACAGACAA 


TGATGCC 


TAG 


AAG AAAAAAT 


ATGGCATTAG 


TCGTTTCCTA 


TACTTTAACA 


ACGACCGTGT 


CAATAAA 


:ta 


AAAGTACCAA 


TGGAAAGTGC 


TTTAACGCAA 


ATGCAAGAAT 


GGGGCAAAGG 


AAAATTA 


GTC 


CCACATTTCG 


TAATTGAAAA 


TATTAATCCA 


GATGCAGGAA 


CGCCGGAAGC 


TCGTGCG 


AAT 


GAAGCGATTG 


AAACCAAACT 


AAATACAACC 


GTTATTTCAA 


TGATTCGTGC 


GAAAGAT 


3AT 


AAAGrrTTTG 


ACAAATCTTT 


AGAAGACTAC 


AAAGCATTCT 


TAAAATCAAA 


TAAATGG 


3AT 


g :aattgaaa 


AAATAAAATC 


TGAGAAAATG 


GCGGAAAACA 


GAGACAAACT 


TAAG 











EF018-4 (SEQ ID NO:68) 
CSGGE 

KKASSDVSIK DRYELDEKTP AWKLDKKKEP TKIKV7YINSD WTALPFGKDV TTAQIKKDLN 

VDIEFISGDD SKLNAKISSG DMPDIVTLTE KTGQAALKAD SWAYSLNDLA KKYDPYLMKV 

VNQDTFKWYA LEDGKTYGYP NYSNTKADYE SGNI PVNDNF VIREDVYNAL GKPDVSTPEN 

FEKVMQQIKE KYPEMTPMGF TTVGDGAGPF LDKLQDFLGV PLEDKNGKYY DRNLDKEYLE 

WLKTFNDVYR AGNISDDSFT CDGATFDEKV KQGNYATMLV AGTSGQGGNF TEFMKKSGTR 

YIAIDGPSST SGRKPTLNQT GISGWLSKTYI TKDAKDPAKV TQLFTYLIDE PGQILTKYGV 

EGVTYAYNDQ GKIDYLPEVK KLEQTDNDAY NKKYGISRFL YFNNDRVNKL KVPMESALTQ 

MQEWGKGKLV PKFVIENINP DAGTPEAPAN EAIETKLNTT VISMIRAKDD KAFDKSLEDY 
KAFLKSNKWD AIEKIKSEKM AENRDKLK 



EF019-1 { SEQ ID NO:69) 

TAAAGGAGTT ACACAATGAA ACTTTTAAAA AAGACGGTCC TAATTGGTAC AACCCTTCTT 
CTTGGTTCAT TCTTACTCGC AGCTTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC 
AAAACACATG AAGTAACAGA TACCTTAGGC AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CGGATTATTG CGAGTTATTT AGAAGATTAT CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
CAATGGACAG TTGGACAAGG CAGCATTCAA GATTATTTAG CGAAAGAATT GAAAGATGTC 
CCCACTATTT CCTATGACTT GCCATATGAA GCGGTTCTAA AATTTGAACC TGACTTATTA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC GGTAAATACA AAGAATACAG TAAAATTGCG 
CCAACTTATG TAG TC AAAAA CGGCGAAAAT GTCACCTGGC GTGATCAATT GGAAGATATT 
GGCACTGTTT TAGATAAAAA AGAACAAGCG AAAAAAGTGT TAGAAGATTA TGATACCTTA 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA TTAGTGGAAG AAATTTCTAA AAACGCTACT 
GCGGATTGGA AT C AAG T TTC TTTAG AAAAA TTAGCTGAGC TTGACGCAGA CCACATTTTC 
CTTGTAAACA GCGATGAATC AGCACCTCTT TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT GATAAAAAAA G TAG TTGG TT ATACAACGGA 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT GTAAAAAAAG CGCTCTTAAA TTAA 

EF019-2 ( ( SEQ ID NO:70) 

MKLLKK TVLIGTTLLL GSFLLAACGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 

I IASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
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ISSSALVEGG KYKEYSKIAP TYWKNGENV 

KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS 

DWNQVSLEKL AELDADHIFL VNSDESAPLF 

IANTQIVEDV KKALLN 



TOR DQ LED I A TVLDKKEQAK KVLEDYDTLT 
DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 



EF019-3 (SEQ ID NO:71) 



TTGTGGT AATACGAATA AAGAAGCCAA CAJ 
AAAACACATG AAGTAACAGA TACCTTAGGC 
CGGATTATTG CGAGTTATTT AGAAGATTAT 
CAATGGACAG TTGGACAAGG CAGCATTCAA 
CCCACTATTT CCTATGACTT GCCATATGAA 
TTAATCAGTT CATCTGCTCT AGTTGAAGGC 
CCAACTTATG TAGTCAAAAA CGGCGAAAAT 
GCCACTGTTT TAGATAAAAA AGAACAAGCG 
ACCAAAGGCG TCCAAGAATA TCTTGGCAAA 
TGGGTAACCA ACAACCAAGT CTTTATGGTT 
TATCAGGACT TAGGCCTCCA AGTTCCAAAA 
GCGGATTGGA ATCAAGTTTC TTTAGAAAAA 
CTTGTAAACA GCGATGAATC AGCACCTCTT 
GCTGTGAAAA ATAACCAAGT TCATACCTAT 
CCTATTGCGA ATACTCAAAT TGTTGAAGAT 



AATAAAGTAA CCGTCCCCGC GAAACCCAAA 
CTAGTTGCAT TAGGAGAAAA ACCAGTGGCA 
GATTATTTAG CGAAAGAATT GAAAGATGTC 
GCGGTTCTAA AATTTGAACC TGACTTATTA 
GGTAAATACA AAGAATACAG TAAAATTGCG 
GTCACCTGGC GTGATCAATT GGAAGATATT 
AAAAAAGTGT TAGAAGATTA TGATACCTTA 
AAAGATGCTG GCAAATCTGC GGCAGTCTTA 
AGCGATAATC GCTCAAGCGG AACCGTGCTC 
TTAGTGGAAG AAATTTC TAA AAACGCTACT 
TTAGCTGAGC TTGACGCAGA CCACATTTTC 
TTCCAAGAAG CAATTTGGAA GAACTTACCT 
GATAAAAAAA GTAGTTGGTT ATACAACGGA 
GTAAAAAAAG CGCTCTTAAA T 



EF019-4 (SEQ ID NO:72) 

CGN TNKEANNADK THEVTDTLGN KVTVPAKPKR 

IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 
ISSSALVEGG KYKEYSKIAP TYWKNGENV TWRDQLEDIA TVLDKKEQAK KVLEDYDTLT 
KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL VEEISKNATA 
DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD KKSSWLYNGP 
IANTQIVEDV KKALLN 



EF020-1 (SEQ ID NO:73) 

TGAGGAGATG AGAAAATGAA AAAGGTAGTT TCAATTTTGT TGATGGTTGT TGCAGTCTTC 
AC ATTAAC TG CATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
ATAATGAAAG ATTCTTCATA TGGTGATGAA TATTCAGAAG ATGGTTTTAG TTTTTTAATA 
TATAAAGATA AAGACACTAA TCGTTATTTG GCTGATGTTT GGGTTCCTGT TAAAGATGAA 
ACTAGCGCAT TGGAGTATTT TTATTATTAT GATGAAGATA AGCGATTAGA TAGTACTAAA 
AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT ATGAAGTAGT GTATAAATCA 
GGGAAATTTA AATAA 

EF020-2 (SEQ ID NO:74) 

MKKWS ILLMWAVFT LTACNGSKLD KTGEEFKNSI MKDSSYGDEY SEDGFSFLIY 
KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEWYKSG 
KFK 



EF020-3 (SEQ ID NO:75) 



ATGTAATGG TTCTAAATTA GATAAAACAG GTGAAGAATT TAAAAATTCT 
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A T AA 7 3 AAA G ATTC77CA7A TGGTGATGAA 7AT7CAGAAG ATGGTTTTAG 7TTTTTAATA 

TATAAAGATA AAGACACTAA TCGTTATTTG GC7GA7GTT7 GGG7TCCTG7 T AAA G A TG AA 

A.C7AGCGCA.T ^GGAG_A.TTT TTA77ATTA.T G A T G AA G AT A AG CG ATT AG A TAGTACTAAA 

AGTAAA3TAA CCTTTGATGA 7ATG AAAG CT AGTGGAAACT ATGAAGTAGT GTATAAATCA 

GGGAAATTTA AA 

EFC2 0-4 { SEC IG NO : 7 6 ; 

CNGSKLD KTGEEFKNSI MX3SSYGDEY SEDGES EL I Y 

KDKDTNRYLA LWP^DET SALEYFYYYC EOKRLCSTKS KVTEDDMKAS GNYEWYKSG 
KFK 



EFC21-1 ( SEQ ID NO:77) 

TAGTTGTTTA AATACATTAA ACTATTTTTA GGAGGC TTTA CAGAAATGAA AAAAGCAAAA 

TTATTCGGTT TTAGTTTGAT TGCATTAGGT TTATCAGTTT CACTTGCAGC ATGTGGTGGT 

GGCAAAGGCA AAACCGCTGA AAGCGGCGGT GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA 

ATCATTACAG ATACAGGCGG CGTGGATGAC AAGTCGTTCA ACCAATCTTC TTGGGAAGGA 

TTGCAAGCTT GGGGTAAAGA ACATGATTTA CCAGAAGGTT CAAAAGGGTA TGCATATATT 

CAATCGAATG ATGCAGCTGA CTATACAACC AATATTGACC AAGCGGTATC AAGTAAATTC 

AACACAATCT TTGGTATTGG CTACTTGCTA AAAGATGCAA TTTCTTCTGC AGCAGATGCC 

AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATGGCAAAAA GAATGTCGTT 

TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC TGCAAATGAA 

AC AAAAA C G A ACAAAGTCGG TTTTGTTGGT GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 

CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA AGAAATTACT 

GTTGATACGA AATATGCGGC TTCATTTGCT GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 

GCAATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTGGTGCGAC TGGACAAGGG 

GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT TTGGGTAATC 

GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG CAAAGAAGAC 

AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA TATTGCCAAC 

CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT AAAAGATGGT 

GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT TAAAACAGCA 

AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAATA A 



EF021-2 (SEQ ID NO:78) 

MKKAKL FGFSLIALGL SVSLAACGGG KGKTAESGGG KGDAAHSAVI 

ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
TIFGIGYLLK DAI SSAADAN PDTNFVLIDD QIDGKKNWS ATFRDNEAAY LAGVAAANET 
KTNKVGFVGG EEGWIDRFQ AGFEKGVADA AKELGKEITV DTKYAASFAD PAKGKALAAA 
MYQNGVDIIF HASGATG QGV FQEAXDLNES GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
FTLTSTLKGV GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 
DKVISGDVKV PEKPE 



EF021-3 ( SEQ ID NO : 7 9 ) 
ATGTGGTGGT 

GGCAAAGGCA AAACCGCTGA AAGCGGCGGT GGCAAAGGGG ATGCAGCGCA TAGTGCTGTA 
ATCATTACAG ATACAGGCGG CGTGGATGAC AAGTCGTTCA ACCAATCTTC TTGGGAAGGA 
TTGCAAGCTT GGGGTAAAGA ACATGATTTA CCAGAAGGTT CAAAAGGGTA TGCATATATT 
CAATCGAATG ATGCAGCTGA CTATACAACC AATATTGACC AAGCGGTATC AAGTAAATTC 
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AACACAATCT TTGGTATTGG CTACTTGCTA AAAGATGCAA TTTCTTCTGC AGCAGATGCC 

AACCCTGATA CAAACTTTGT TTTAATCGAT GATCAAATCG ATGGCAAAAA GAATGTCGTT 

TCTGCAACAT TTAGAGATAA TGAAGCAGCT TACTTAGCCG GTGTTGCTGC TGCAAATGAA 

ACAAAAACGA ACAAAGTCGG TTTTGTTGGT GGTGAAGAAG GGGTCGTAAT TGACCGTTTC 

CAAGCTGGTT TTGAAAAAGG TGTGGCTGAT GCTGCGAAAG AATTAGGTAA AG AAATTAC T 

GTTGATACGA AATATGCGGC TTCATTTGCT GATCCTGCCA AAGGGAAAGC TTTAGCTGCT 

GC AATGTACC AAAACGGCGT TGATATCATC TTCCATGCTT CTGGTGCGAC TGGACAAGGG 

GTCTTCCAAG AAGCAAAAGA CTTGAATGAA TCAGGTTCTG GCGACAAAGT TTGGGTAATC 

GGCGTTGACC GCGATCAAGA TGCTGATGGC AAGTACAAAA CAAAAGACGG CAAAGAAGAC 

AACTTCACGT TAACTTCAAC GCTTAAAGGT GTCGGCACAG CGGTTCAAGA TATTGCCAAC 

CGTGCGTTAG AAGACAAATT CCCTGGTGGC GAACATTTAG TTTATGGATT AAAAGATGGT 

GGCGTTGACT TAACAGACGG CTATTTAAAC GACAAAACAA AAGAAGCTGT TAAAACAGCA 

AAAGATAAAG TAATCTCAGG TGACGTAAAA GTCCCAGAAA AACCAGAA 



EF021-4 (SEQ ID NO:80) 

CGGG KGKTAESGGG KGDAAHSAVI 

ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP 

TIFGIGYLLK DAISSAADAN PDTNFVLIDD 

KTNKVGFVGG EEGWIDRFQ AGFEKGVADA 

MYQNGVDI IF HASGATGQGV FQEAKDLNES 

FTLTSTLKGV GTAVQDIANR ALEDKFPGGE 
DKVISGDVKV PEKPE 



EGSKGYAYIQ SNDAADYTTN IDQAVSSKFN 
QIDGKKNWS ATFRDNEAAY LAGVAAANET 
AKELGKEITV DTKYAASFAD PAKGKALAAA 
GSGDKVWVIG VDRDQDADGK YKTKDGKEDN 
HLVYGLKDGG VDLTDGYLND KTKEAVKTAK 



EF022-1 {SEQ ID NO:81) 

TAAGAGCATA AAAAAATGAA GAGTTATAGG 
ACAATGGTTT GTATTTTATT GGTAGGATTT 
AAAAAGAAAC AGAAAAATAC CAAAGAAGCC 
ACGCTCAACA CCTCTGTATT ATTGGATTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC 
CCGATGATTT CAGAAGATGG AAAAACCTAC 
AGTAACGATG ATCCTGTCAC AGCACATGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA 
GATAAAGTCG TCTATAATGG TCCGTTCGTG 
TGGC AAC TAG CAAAAAATAA TCGCTATTGG 
AATTATACAG TTATCAAAGA AACATCTACC 
GATGTGGCTA CACTAAGTGG TGAACTGGCG 
TCGTATCCAA CAGCGACAAT GAACTATTTG 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA 
CTAGTCAATA ATATTATTGC AGATGGTTCT 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG 
GTTAACGTTG AATTGATGGT AACAGATGAT 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT 
CCAGACTATC AAGACCCTAT TTCTACCCTG 
TATCAGAACC CTGTCTATGA CAAATTATTA 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA 



AGAAAGAAGA TGAAAAAGTA TTTAAAAATC 
TTAGCTGGGT GTACCAATAA AAATGAAAAT 
GTTCAACTGA TGTCACCCTC GGAATTAACA 
CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
TTTGAATATG CTTGGAAAAA AATGATTGAT 
ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TACTTTACGT CCTTGTTAGC TTTTCCGACA 
CAATTTGGTG CGGACTATGG AACTGCTAGT 
GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
GATCACCAGA ACGTGCGCTC AGACATTATC 
GCATTGAATC TTTTTGAAGA TGGACAATTA 
CAACAGAATA AAAATAATAC GTTGTATCAT 
CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
TTGACAATAG AGCTAACCGC ATTGCCGACT 
GACTATGATT TATTCTTAAT TTACTGGACA 
ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
GATGAAGCAG CCACAACCTA TGCCTTAGAG 
GCTGAAAAAG AAGTGATTGA AACGACTGCT 
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GGCATGATTC CACT2AGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AG TC AAA GGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAATAA 



EF222-2 ;SEQ IE NC:S2) 

XXEYEKIT MVCZLLVGFE AGCTNK NFNK XKQKKTKF^V QLMSPSELTT 

LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP MISEDGKTYT ISLRKEAVWS 

NDDPVTAHCF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDCYTL 

KVTLKEPKPY FTSLLAFPTF FPQKXK WEQ FGACYGTASE K WYNG P F W KDWQQTKMDW 

QLAKKNRYWD HQNVRSDIIN YTVIKETSTA LNLFEDGQLE 1 VATLSGELAQ QNKNNTLYHS 

YPTATMNYLF. LNQKRXGQAT PLANENLRKA LALGIDKENL VNNIIADGSK ALHGAITEGF 

VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV KVELMVTDDG SYKKIGESLQ 

GSLQELFPGL TIELTALPTE AALNFGRESC YDLFLIYWTF DYQDPISTLM TLYKGNDRNY 

QKPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 



EF022-3 ( SEQ ID NO:83) 
GT GTACCAATAA AAATGAAAAT 

AAAAAGAAAC AGAAAAATAC CAAAGAAGCC GTTCAAGTGA TGTCACCCTC GGAATTAACA 
ACGCTCAACA CCTCTGTATT ATTGGATTTT CCAGATGCTA TTGTCCAAAC TGCAGCGTTT 
GAAGGGTTAT ATAGTTTAGA TGAACAAGAC CAATTGGTAC CAGCCGTAGC AAAAGCATTG 
CCGATGATTT CAGAAGATGG AAAAAC CTAC ACGATTTCTT TGAGAAAAGA AGCGGTTTGG 
AGTAACGATG ATCCTGTCAC AGCACATGAT TTTGAATATG CTTGGAAAAA AATGATTGAT 
CCTAAAAACG GCTTTGTTTA TAGCTTCCTC ATCGTTGAAA CAATTCAAAA TGGTGCAGAA 
ATCTCAGCGG GGAAATTAGC ACCCAATGAA CTAGGTGTCA CAGCTGTGGA TGATTATACA 
TTAAAGGTGA CGCTCAAAGA GCCAAAACCG TACTTTACGT CCTTGTTAGC TTTTCCGACA 
TTTTTCCCGC AAAATCNAAA AGTAGTCGAA CAATTTGGTG CGGACTATGG AACTGCTAGT 
GATAAAGTCG TCTATAATGG TCCGTTCGTG GTAAAAGATT GGCAGCAAAC AAAGATGGAC 
TGGCAACTAG CAAAAAATAA TCGCTATTGG GATCACCAGA ACGTGCGCTC AGACATTATC 
AATTATACAG TTATCAAAGA AACATCTACC GCATTGAATC TTTTTGAAGA TGGACAATTA 
GATGTGGGTA CACTAAGTGG TGAACTGGCG CAACAGAATA AAAATAATAC GTTGTATCAT 
TCGTATGCAA CAGCGACAAT GAACTATTTG CGCTTAAATC AAAAACGGNA AGGGCAAGCN 
ACGCCGCTTG CAAACGAAAA CCTGCGTAAA GCATTGGCTT TAGGAATAGA TAAAGAAAAT 
CTAGTCAATA ATATTATTGC AGATGGTTCT AAAGCGCTAC ATGGTGCGAT TACGGAAGGC 
TTTGTGGCGA ATCCCACAAC GGGTCTCGAT TTTCGTCAAG AAGCAGGTAA TTTAATGGTT 
TATAACAAAG AAAAAGCGCA AAGTTATTGG AAAAAAGCAC AAGCAGAATT AGGAGAAAAG 
GTTAACGTTG AATTGATGGT AACAGATGAT GGTTCTTACA AAAAAATTGG TGAAAGTTTG 
CAAGGCTCGC TACAAGAATT GTTTCCTGGT TTGACAATAG AGCTAACCGC ATTGCCGACT 
GAAGCTGCAT TGAACTTTGG GCGAGAAAGT GACTATGATT TATTCTTAAT TTACTGGACA 
CCAGACTATC AAGACCCTAT TTCTACCCTG ATGACTTTAT ACAAGGGCAA TGATCGCAAT 
TATCAGAACC CTGTCTATGA CAAATTATTA GATGAAGCAG CCACAACCTA TGCCTTAGAG 
CCAGAAAAAA GATGGGCGAC ACTGATTGCA GCTGAAAAAG AAGTGATTGA AACGACTGCT 
GGCATGATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAAGGC 
TTGAATTTTC ATACCTTTGG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAA 



EF022-4 (SEQ ID NO:84) 
CTNKNENK KKQKNTKEAV QLMSPSELTT 

LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP MISEDGKTYT ISLRKEAVWS 
NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 
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KVTLKEPKPY FTSLLAFPTF FPQNXKWEQ FGADYGTASD KWYNGPFW KDWQQTKMDW 
QLAKNNRYWD HQNVRSDI IN YTVIKETSTA LNLFEDGQLD VATLSGELAQ QNKNNTLYHS 
YPTATMNYLR LNQKRXGQAT PLANENLRKA LALG I DKENL VNNIIADGSK ALHGAITEGF 
VANPTTGLDF RQEAGNLMVY NKEKAQSYWK KAQAELGEKV NVELMVTDDG SYKKIGESLQ 
GSLQELFPGL TIELTALPTE AALNFGRESD YDLFLIYWTP DYQDPISTLM TLYKGNDRNY 
QNPVYDKLLD EAATTYALEP EKRWATLIAA EKEVIETTAG MIPLSQNEQT VLQNDKVKGL 
NFHTFGAPLT LKNVYKEK 

EF023-1 (SEQ ID NO:85) 

TAAAATGGAG GGATCGGTAT GAAGAAATTA AAAATGTTAG GATGCGTCGG GTTGCTTTTA 
GCTTTAACGG CTTGTCAGGC GGGAACGGGA AACTCGGCTG ATAGTAACAA AGCAGCGGAA 
CAAAAAATTG CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGACTTTAGT CATGAATCAA GTTTATGAAG GACTCTATGT TTTAGGTAAA 
GAAGATGAAT TAGAGTTGGG GGTCGCTGCC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTATACAT TTAAGATTAG AGAAGATGCC AAATGGTCGA ATGATGATCC AGTAACAGCA 
AACGACTTTG TTTATGCATG GCAACAAGTT GCTTCCCCTA AATCAGGATC GATTCATCAA 
GCTTTATTTT TTGATGTCAT TAAAAATGCT AAGGAAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGAT AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTATT TGAAATCATT ACTTTCGTTT CCTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAC AAGGGGATAA ATATGCTACT GATGCAGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATGCCTCT TCTGATGACT GGACCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTGAATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT CCAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGCGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC TACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC CTACGATACA 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA T A C AG A C AG C GCCAAAAAAG CTGCTGAGTT TTTCCAATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAAC CGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG T A C AG TTTAA AATGGGCTTA TGTTAACTAA 



EF023-2 (SEQ ID NO:86) 

MKKLK MLGCVGLLLA LTACQAGTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 
TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LYWHSFGPTY SLKWAYVN 
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Er 323-3 ( SEQ ID NC:&7) 

^tGGAACGGGA .mACTCGgCTG ATAG ^ A-A'_mA* ^uAb^GGA". 

CAAAAAATT3 CAATTAGTTC TGAAGCGGCT ATTTCGACAA TGGAACCACA CACAGCGGGG 
GATACGACCT CGAC77TAGT CATGAATCAA G777A7GAAG GAGTCTATGT 777AGGTAAA 
GAAGA7GAAT TAGAG TTGGG GGTCGCTGGC GAAGAACCAG CGATTTCTGA AGATGAAACC 
GTTTA7ACAT TTAAGATTAG AGAAGATGCC AAA7GGTCGA ATGATGATGC AGTAACAGCA 
AACGACTTTG TTTA7GCATG GCAACAAGT7 GCTTCCCCTA AATCAGGATG GATTCATCAA 
GCTT7ATTT7 TTGATGTCAT TAAAAATG 27 AAG G AAATTG CTTTAGAAGG CGCAGATGTG 
AATACTCTTG GGGTTAAGGC GCTAGATGA7 AAAACGTTAG AAATAACTTT AGAACGGCCC 
ACCCCTTAT7 TGAAATCATT AGTTTCGTTT ICTGTTTTGT TTCCACAAAA TGAAAAATAT 
ATCAAAGAAI AAGGGGATAA ATATGGTACT GA7G 3AGAAC ATTTGATTTA TAATGGTCCT 
TTTAAATTGA AAGAATGGGA TAATG CCTCT TCTGATGACT GGAGCTACGA AAAAAATGAT 
ACGTATTGGG ATGCTGAAAA AGTTAAATTA ACAGAAGCGA AAGTTTCAGT AATTAAGAGC 
CCAACGACAG CGGTG AATTT GTTTGACTCG AATGAATTGG ATGTAGTGAA TAAGCTAAGT 
GGTGAATTTA TTCCTGGTTA TGTTGATAAT 2CAGCCTTTC TTTCAATTCC TCAATTCGTC 
ACATACTTTT TAAAAATGAA CAGGGTTCGT GATGGAAAAG AAAATCCGGC TTTAGCGAAC 
AACAATATTC GTAAAGCGTT GGCACAAGCT TTTGATAAAG AAAGTTTTGT AAAAGAAGTC 
TTGCAAGATC AATCAACGGC 7ACAGATCAA GTAATTCCGC CGGGACAAAC GATTGCGCCA 
GATGGAACAG ATTTCACAAA ACTAGCTGCT AAGAAAAATA ACTACTTAAC GTACGATACA 
GCGAAAGCAA AAGAATTCTG GGAAAAAGGG AAAAAAGAAA TTGGGCTGGA TAAAATCAAA 
TTAGAATTTT TAACAGATGA TACAGACAGC GCCAAAAAAG CTGCTGAGTT TTTC C AATTT 
CAATTGGAAG AAAATCTAGA TGGATTAGAA GTGAATGTTA CTCAAGTTCC TTTTACTATT 
CGTGTTGATC GTGATCAAAC GAGAGACTAT GATTTAGAAT TATCTGGTTG GGGAACCGAT 
TATCGTGATC CATTAACAGT TATGCGCATC TTTACTTCGG ATAGTACCTT GGGCGGCGTA 
ACGTTCAAGA GTGATACGTA TGATCAATTA ATTCAAGAAA CTAGAACAAC ACATGCGGCT 
GATCAAGAGG CTCGTTTAAA TGACTTTGCT CAAGCACAAG ATATTTTGGT GAATCAGGAA 
ACGGTTTTAG CACCAATCTA CAATCGAAGC ATTTCTGTAT TAGCTAATCA AAAAATCAAG 
GATCTGTATT GGCATTCATT TGGACCCACG TACAGTTTAA AATGGGCTTA TGTTAAC 



EF023-4 (SEQ ID NO:88) 

GTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD 

TTSTLVMNQV YEGLYVLGKE DELELGVAAE EPAISEDETV YTFKIREDAK WSNDDPVTAN 
DFVYAWQQVA SPKSGSIHQA LFFDVIKNAK EIALEGADVN TLGVKALDDK TLEITLERPT 
PYLKSLLSFP VLFPQNEKYI KEQGDKYATD AEHLIYNGPF KLKEWDNASS DDWTYEKNDT 
YWDAEKVKLT EAKVSVIKSP TTAVNLFDSN ELDWNKLSG EFIPGYVDNP AFLSIPQFVT 
YFLKMNSVRD GKENPALANN NIRKALAQAF DKESFVKEVL QDQSTATDQV IPPGQTIAPD 
GTDFTKLAAK KNNYLTYDTA KAKEFWEKGK KEIGLDKIKL EFLTDDTDSA KKAAEFFQFQ 
LEENLDGLEV NVTQVPFTIR VDRDQTRDYD LELSGWGTDY RDPLTVMRIF TSDSTLGGVT 
FKSDTYDQLI QETRTTHAAD QEARLNDFAQ AQDILVNQET VLAPIYNRSI SVLANQKIKD 
LiYWHSFGPTY SLKWAYVN 

EF024-1 (SEQ ID NO:89) 

TAATGGCCGT TTCGTCTACT AATAAAGAGG ATGAAGCTAC TCAAATGGCG TTGGCAATGG 
AACAAGGATC ATAAAAAAGG AGAAGTGAGC ATGAAAAAAG TACTACCTTT TATTGCCTTA 
GTCGGCTTGT TATTGTTGTC AGGTTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAATAA 
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EF024-2 (SEQ ID NO:90) 

M KKVLPFIALA/ GLLLLSGCGT DMKKILTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF024-3 (SEQ ID NO:91) 
ATT GACTGCCGAT 

GGTGGTAAAT GGAAAGTGGA AGAAACACGT GCAACTTACA CTTTTTTTGA TGACGGTAAA 
TTTTCAGCTA ATGACTCAGA GGATAGTGTT AGTGGGACAT ACACTTATGA TGAAAAAAAT 
AAAAAAATAA CCTTTGACNT TACTAGCAGN AACTCTTTCA TTATGGAAAA AGTNGANTNC 
AANGNTANCA AGATTACAGG GGAAATTGGC GAAAAACAAA GAACACTTAT AAAACAAAAA 
ACAGAA 



EF024-4 { SEQ ID NO:92) 



LTADG 

GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX 
XXKITGEIGE KQRTLIKQKT E 



EF025-1 (SEQ ID NO:93) 



TGAATGAAAC ATATTAAAGG AATGTTGGTT 
GCGCCAGATC AAGAGCCAAC GAAACAAACA 
AAGCAAGTTA CCGTCACCAA TCAAACGACT 
AATGACGAAC TGATTGCTAA TCAATTGACT 
GTTACAGGGG CC AC AC AAAC GACATTTGGA 
GAAAAAAAGA AAAAAATGTT TTGGTCCAAT 
TATTATAAAA ATGAAGGTGT ATTTACTGGC 
GAACCTGAAA CGCAAAGGAT TCTGAATGTT 
TATGATACAC GCTATTCGGG TGTCAACAAA 
AGCAACACGC GTACAGACGA TACGTTAGTC 
AAACAAATGC GTGACGAAAA TCGTGTTACA 
ACTTCTGCGC GTGAAGGATT AATGCCTTTA 
CCATCGAAAG AAACGTATAT CGGTTACGCA 
CTTCAAGTGA TAACAGAAGA GCAGAAAATA 
GATGAACAGG AAAAAATCAC AGAAACAGCC 
ATTCACCAGG ATACAATAAA CAAACCAACA 



TTTATCGGAT TATTTATTTT GGTTGGTTGT 
ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG 
TCTGCGGTGG AAAAACAAGC GCCGACTAAA 
TTTGATTCTC ATGAATACAC GTACGAAGTG 
ACAACCCCAC CAGCAAAATA TACACCGGAA 
CAACCGCCTT TGGGATTAAT GACGGGTAAC 
GGAAATTACG GCATTGTAGA GATTATTACG 
GAGTTTACAG AGTTTGCTAG TGATCCTTAT 
CGCCTGTCGG ATTATCCTGA ATTTCAAGCA 
ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
GGTAATTTTT ATACGGTACG CGGTTCATCA 
GCAGCAGAGA TGGACACTTG GCTAAAAGAG 
GAAGATTTAG GCAATGGCCT AATCGCTCGA 
AAACATGTCA GCTATGATGA ATACTTTTCA 
TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCTTTTAT TCATTTTGTA G 



EF025-2 (SEQ ID NO: 94) 

MKHIKGMLVF IGLFILVGCA PDQEPTKQTT 
DELIANQLTF DSHEYTYEW TGATQTTFGT 
YKNEGVFTGG NYGIVEIITE PETQRILNVE 
NTRTDDTLVT WNGITYVEK QMRDENRVTG 
SKETYIGYAE DLGNGLIARL QVITEEQKIK 
HQDTINKPTI LLFIL 



SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 
TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 
FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NFYTVRGSST SAREGLMPLA AEMDTWLKEP 
HVSYDEYFSD EQEKITETAC GLFIVNRNI I 



EF025-3 (SEQ ID NO: 95) 
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AA^ ; ^.*-tAvA A>iA ~2 A /v^.kAGTG^TC LG^nAuAGAC Aj-iAG**-AAGTG 

AAGCAAGTTA CTGTCACCAA TCAAACGACT TCTGCGGTGG AAAAACAAGC GCGGACTAAA 
AATGACGAAC T3ATTGCTAA TCAATTGACT TT7GATTCTC A7GAATACAC GTACGAAGTG 
G7TACAGGGG C7ACACAAAC GACATTTGGA ACAACCCCAC CAGCAAAATA TACACCGGAA 
GAAAAAAAGA AAAAAATG 77 77GGTCCAAT CAACCGCCTT 7GGGATTAAT GACGGGTAAC 
T A T 7 A T AAAA ATGAACGTGT A777ACTGG? L-GAAATTACG GCATTGTAGA GATTATTACG 
gaa::tgaaa CG7 AAAGGA7 TCTGAATGTT GAGTTTACAG AGTTTGC7AG TGATCCTTAT 
TATGATACAC GCTATTCGGG TGTCAACAAA CGCCTG7CGG ATTATCCTGA AT7TCAAGCA 
AGCAACACGG GTACAGACGA TACGTTAGT2 ACCGTTGTTA ATGGTATTAC TTATGTAGAA 
AAAGAAATGC GTGACGAAAA TCGTGTTAGA GGTAAT77TT ATACGGTACG CGGTTCATCA 
AC7TCTGCGG GTGAAGGATT AATGCCTTTA GTAGCAGAGA TGGACACTTG GCTAAAAGAG 
CCA7CGAAAG AAACGTATAT CGGTTACG 2A GAAGATTTAG GCAATGGCCT AATCGCTCGA 
CTTCAAG7GA TAACAGAAGA GCAGAAAATA AAA:ATGTCA GCTATGATGA ATACTTTTCA 
GATGAACAGG AAAAAATCAC AGAAACAGCC TGCGGCCTTT TTATCGTCAA TCGAAATATT 
ATTCACCAGG ATACAATAAA CAAAGCAACA ATTCTTTTAT TCATTTTG 

EF025-4 (SEQ ID NC : 9 6 ) 

TKQTT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN 

DELIANQLTF DSHEYTYEW TGATQTTFGT TPPAKYTPEE KKKKMFWSNQ PPLGLMTGNY 
YKNEGVFTGG NYGIVEI ITE PETQRILNVE FTEFASDPYY DTRYSGVNKR LSDYPEFQAS 
NTRTDDTLVT WNGITYVEK QMRDENRVTG NFYTVRGSST SAREGLMPL.A AEMDTWLKEP 
SKETYIGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD EQEKITETAC GLFIVNRNII 
HQDTINKPTI LLFIL 



EF026-1 (SEQ ID NC-.97) 

TGAGTGTATG ATTACTCATT TCCCTTTGAA TCAGTTATGA TAAAGGAAGA AATAAATAAA 
TTTTTTGGAG GGATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GAGGGCAACT 
GCTGCTCTTG TGTTGCTTAG TGCTTGTTCA TCTGATAAAA AAACAGATAG TAGTTCTAGT 
AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCGCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 
AAAG TTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC TGACTTAGAA 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAATTAA 

EF026-2 ( SEQ ID NO: 98) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGAS1SAKP EELEMALSDK GKWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 

EF026-3 ( SEQ ID NO: 99) 

AACAGATAG TAGTTCTAGT 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAG 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC AGCTACTGAC 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGAAAGAT 



WO 98/50554 



PCT/US98/08959 



107 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E, faecalis Genes. 

TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 

GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTT CAACATCGTT 

CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT AAATGGTACC 

AAAGTTAATG GCAATATTAC TTTTG A T AAA CAAGAATACA AAGATTCTGC TGACTTAGAA 

AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAAT 

EF026-4 (SEQ ID NO:100) 

TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK ANGFTLNGTK 
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN 



EF027-1 (SEQ ID NO:101) 

TTTGGTATGA AACAGAAAAA GTGGTTAATC GGACTTGTTG CACTGGGCTT GGTTTTAGCA 

GCATGTGGAA GTGGCGGTTC GAAAACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC 

GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT 

TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT 

GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC 

AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 

GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT 

TTTAAAAATG GGCGTGCGGT GCGGGAAGGA CAAGCCACGA TGGAAGAATT TGGTGTCAAA 

GCAATCGATG ACCAGACACT AGAACTAACA TTGGAAAATC CAATTCCTTA TTTAGCCCAA 

GTCTTGGTTG GGACACCTTT TATGCCTAAA AATGAAGCCT TTGCCAAAGA AAAAGGTACT 

GCCTATGGGA CTTCTGCAGA TAATTTTGTT GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 

GATGGCAATT CCGAAACTTG GAAATTGAAG AAGAATGATC ATTATTGGGA TAAAGAACAC 

GTAAAATTGA ATGAAATTGA TGTTCAAGTA GTGAAAGAAA TTGGCACAGG AGCCAATCTT 

TTTGATAATG GCGACTTAGA TTACACTGTT TTAGCAGATA CTTATGCACT TCAGTATAAA 

GAGTCAAAAC AAGCGCATTT TGTACCTAAA GCCATGGTGG GTTATTTAAG CCCCAATCAT 

CGCCGTGAAA TTACCGGCAA CGAACATGTT CGAAAAGCTT TTTTACAAGC GATTGACAAA 

GAAAC TTTTG CAAAAGAAAT TTTAGGAGAT GGCTCGACAG CTTTAAATGG NTTTGTACCA 

GCTAATTTTG CAAAAATCCA GATACAGGTG AAGATTTCCG CAAAGAAAAT GGTGATTTAT 

TGCCATATAA TATTAAAGAA GCCCAAGCTA ACTGGAACAA TT 

EF027-2 (SEQ ID NO:102) 

MKQKKWLI GLVALGLVLA ACGSGGSKTT SNEPATQKIN VASGGELSTL DSAHYTDVYS 
SDMIGQWEG LYRQDKNGDP ELAMAKAEPQ VSEDGLVYTF KLREAKWTNG DPVKAGDFW 
AFRNWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENPIPYLAQ 
VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWKLK KNDHYWDKEH 
VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAHFVPK AMVGYLSPNH 
RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAKKMVIY 
CHI ILKKPKL TGTI 

EF027-3 (SEQ ID NO:103) 

AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC 

GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC ATTATACAGA TGTCTATAGT 
TCCGATATGA TTGGTCAAGT AGTTGAAGGC TTGTATCGAC AAGATAAAAA CGGAGATCCT 
GAGCTAGCTA TGGCGAAAGC AGAGCCACAA GTTAGTGAAG ACGGGTTAGT CTATACATTC 
AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA AAGCAGGGGA TTTTGTAGTT 
GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA GCAGTAATCA AATGGATATT 
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tttaaaaat^ ggcgtg:ggt gcgggaagga caagccacga tggaagaatt tggtgtcaaa 
gcaa.tc3at3 ag cagacact agaactaaca ttggaaaatc caattcctta tttagcccaa 

GTCTTGGTTG GGACACGTTT TATGCCTAAA AATGAAGCCT TTGCGAAAGA aaaaggtact 
GCGTATGGGA CTTGTG :AGA TAAT7TTGTT GGCAATGGGC CGTTTGTAAT TTCAGGTTGG 
GATGGCAATT C rGAAATTTG GAAAT'TGAAG AAGAA7GATC ATTATTGGGA TAAAGAACAC 
GTAAAATTGA ATGAAATTGA TGTTCAAGTA GTGAAAGAAA TTGGGACAGG AGCGAATCTT 
TTTGATAATG GC3ACTTAoA TTACA2TGTT TTAGCAGATA CTTATGC ACT TGAGTATAAA 
GAGTCAAAAC AAGCGGATTT TGTACGTAAA GCCATGGTGG GTTATTTAAG GCCCAATCAT 
CGCC 3TGAAA TTACCGGGAA CGAACATGTT CGAAAAGCTT TTTTACAAGC GATTGACAAA 
GAAACTTTTG CAAAAGAAAT TTTAG3AGAT GGGTCGACAG CTTTAAATGG NTTTGTACCA 
GCTAATTTTG CAAAAA7CCA GATACAGGTG AAGATTTCCG CAAAGAAAAT GGTGATTTAT 
TGCCATATAA TATTAAAGAA GCCCAAGGTA A 

EF027-4 (SEQ ID NC : 104) 

TT SNEPATQKIN VASGGELSTL DSAKYTDVYS 

sdm:gqweg lyrqg?;ngdp ELAMAKAEPQ vsedglvytf klreakwtng dpvkagdfw 

AFRUWDPAY GSSSSNQMDI FKNGRAVREG QATMEEFGVK AIDDQTLELT LENP IPYLAQ 
VLVGTPFMPK KEAFAXEKGT AYGTSADNFV GNGPFVISGW DGNSETOKLK KNDHYWDKEH 
vkl::eidvqv VFEIGTGANL FDNGELDYTV LADTYALQYK eskqahfvpk amvgylspnh 
rreitgnehv rkaflqaidk etfakeilgd gstalngfvp anfakiqiqv kisakkmviy 
chiilkkpkl 



ef028-1 (seq id no:105) 

taacagaagc aatacaacaa cttaacactt 
aagacttgtt atagtcaatg tatgggtaga 
agagctttgc TAGGGGTTAC cttattaaca 
tctgaacaga aaagcggcga aaaacaaaca 
gaaaaaggat cagtaaaaaa tgttattttt 
acaacgggct atcgctattt caaagccaat 
tttgatacct atttggtcgg acagcaagcc 
accgattcag cttccgcagc gacagcgatg 
attgcactcg ataatgacaa gtccaaaaca 
gggaaatcaa cgggtcttgt agcaacatct 
ggcgcacata atgtttcacg caaaaatatg 
caaatcgacg gacaacacaa agtcgatgtg 
cggaaagatc gtgatttagt caaagaattt 
aaaaagtcgt taaatgagaa ccaagacgac 
ctacctaaaa tgattgaccg aacggaagaa 
gctcttcaac ggttagataa aaatgaaaaa 
attgattggg ccgggcatag caatgatatt 
gaagcggcgt ttgaaaaggc catcgatttt 
caactgcaga tcattcaaca gggggcttgt 

ef028-2 { seq id no:106) 

mkkr allgvtlltf ttlagctnls 
eqksgekqte vaeakatese kasvknvifm 
dtylvgqqat ypedeeenvt dsasaatama 
kstglvatse ithatpaayg ahnvsrknma 

KDRDLVKEFS QAGYGHVTDK KSLNENQDDK 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV 



TGTTTACTTG TTATTTATCA GAAATCAACT 
TATGAAGGAG GAAACAAGGA AATGAAGAAA 
TTCACAACAT TAGCGGGTTG T AC AAA TTTA 
GAGGTTGCTG AAGCGAAGGC AACTGAATCT 
ATGATTGGAG ATGGCATGGG GAATCCGTAT 
CACTCAGACA AGCGTGTTCC CCAAACAGCT 
ACTTATCCAG AAGATGAAGA AGAGAATGTC 
GCTGCCGGAG TGAAAACCTA TAATAATGCT 
GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GAAATAACAC ATGCAACCCC TGCTGCATAT 
GCAGAAATCG CCGATGACTA TTTTGATGAT 
TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CTTTAG 



IGDGMGNPYT TGYRYFKANH SDKRVPQTAF 
AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 
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LQI IQQGACL 



EF028-3 (SEQ ID NO:107) 



ACAGA AAAGCGGCGA AAAACAAACA GAGG r 
GAAAAAGCAT CAGTAAAAAA TGTTATTTTT 
ACAACGGGCT ATCGCTATTT CAAAGCCAAT 
TTTGATACCT ATTTGGTCGG ACAGCAAGCC 
ACCGATTCAG CTTCCGCAGC GACAGCGATG 
ATTGCACTCG ATAATGACAA GTCCAAAACA 
GGGAAATCAA CGGGTCTTGT AGCAACATCT 
GGCGCACATA ATGTTTCACG CAAAAATATG 
CAAATCGACG GACAACACAA AGTCGATGTG 
CGGAAAGATC GTGATTTAGT CAAAGAATTT 
AAAAAGTCGT TAAATGAGAA CCAAGACGAC 
CTACCTAAAA TGATTGACCG AACGGAAGAA 
GCTCTTCAAC GGTTAGATAA AAATGAAAAA 
ATTGATTGGG CCGGGCATAG CAATGATATT 
GAAGCGGCGT TTGAAAAGGC CATCGATTTT 
CAACTGCAGA TCATTCAACA GGGGGCTTGT 



TGCTG AAGCGAAGGC AACTGAATCT 
ATGATTGGAG ATGGCATGGG GAATCCGTAT 
CACTCAGACA AGCGTGTTCC CCAAACAGCT 
ACTTATCCAG AAGATGAAGA AGAGAATGTC 
GCTGCCGGAG TGAAAACCTA TAATAATGCT 
GAAACAGTGC TCGAACGTGC GAAAAAAGTG 
GAAATAACAC ATGCAACCCC TGCTG CATAT 
GCAGAAATCG CCGATGACTA TTTTGATGAT 
TTACTTGGCG GCGGCTCCGA ATTATTTGCC 
TCCCAAGCGG GTTATGGTCA TGTCACAGAC 
AAAATTTTAG GCTTGTTTGC ACCAGGCGGG 
GTCCCTTCAT TAGCTGATAT GACAGAAGCG 
GGTTTCTTTT TAATGGTTGA AGGTAGTCAA 
GTTGGCGCGA TGAGCGAAAT GCAAGACTTC 
GCCAAAAAAG ATGGTGAACA TTGGTGGTTA 
CTT 



EF028-4 (SEQ ID NO:108) 



QKSGEKQTE VAEAKATESE KASVKNVIFM : 
DTYLVGQQAT YPEDEEENVT DSASAATAMA 
KSTGLVATSE ITHATPAAYG AHNVSRKNMA 
KDRDLVKEFS QAGYGHVTDK KSLNENQDDK 
LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV 
LQI IQQGACL 



GDGMGNPYT TGYRYFKANH SDKRVPQTAF 
AGVKTYNNAI ALDNDKSKTE TVLERAKKVG 
EIADDYFDDQ IDGQHKVDVL LGGGSELFAR 
ILGLFAPGGL PKMIDRTEEV PSLADMTEAA 
GAMSEMQDFE AAFEKAIDFA KKDGEHWWLQ 



EF029-1 (SEQ ID NO:109) 



TGAAGGAGGG AGAAAATGAA 
GCCACTTTTT TATTATCAGG 
GAAGTTACTG AAAATGTTAC 
GTTAGGCACG TGGATGGCGA 
CGGTTTTTAT TAATTG AC AC 
GGATTGGAAG CTAGCAAACG 
GAATATGATA AGGGCGATAA 
GATGGAACAT TACTACAAAA 
AAAGAGCCTA CAACTAAGTA 
GAGTCACTCG GAATCTGGAG 



AAAGTTAATC GGTAAAAAGT 
ATGCGCAAGT CTTGAACAAA 
TCAAACTATT TCAAACGATC 
TACCACAGTA TTAAAAATTG 
ACCCGAGACT GTGAAACCGA 
CACAAAAGAG CTTTTGTCTA 
AACAGATCGT TACGGACGAG 
AACGCTTGTA AGTGAAGGAT 
TTTGGCAGAA CTAGAGCAAG 
CATACCAGGT TATGTGACAC 



GGCTGCTGCT TACAGCAGTA 
AAGCACAGGA TAGTGTAAAA 
AACGTATACC AGCTGATTTT 
ACGGAAAAGA ACAAAAAGTT 
AAACAAAAGT TCAGCCGTTC 
CTGCTTCAGA AATTACGTTT 
CGTTGGGCTA CATATTCGTA 
TAGCTCGTGT TGCCTATGTA 
CCCAAGAACA GGCTAAAAAT 
AACGGGGGTT TAGTAAATAA 



EF029-2 (SEQ ID NO: 110) 



MKKLIG KKWLLLTAVA TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV 
RHVDGDTTVL KIDGKEQKVR FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE 
YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 
SLGIWSIPGY VTQRGFSK 



EF029-3 (SEQ ID NO:lll) 



AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT 

GTTAGGCACG TGGATGGCGA TACCACAGTA TTAAAAATTG ACGGAAAAGA ACAAAAAGTT 
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ooaTTG G/iAvj TA^_ ,nA/v^ G Cm. --.rtA-nA^AG 

GAATATGATA AGGGCGATAA AACAGATCGT 

LaATGGAACA. TACTACAAAA AACGC-TGTA 

AAAGAGCCTA CAACTAAGTA TTTGGCAGAA 

GAGTCACTCG GAA.TCTGGAG CATACCAGGT 

EF029-4 (SEv ID NG: 112} 



GTGAAACCGA AAACAAAAGT TCAGCCGTTC 

:ttttgtcta CTGCTTCAGA AATTACGTTT 

TAG jGACGAG CGTTGGGCTA GATATTCGTA 

AGT SAAGGAT TAGGTCGTGT TGCCTATGTA 

:tasagcaag CCCAAGAACA GGCTAAAAAT 

TAT 3 TG AC AC AACGGGGG7T TAG T AAA 



NVTQTIS NDQF.IPADFY 

RHVDGDTTVT KIDGKEQKVR FLLIDTPETV KPKTKVQPFG LEASKRTKEL LSTASEITFE 
YDKGDKTDRY G RALGY I FVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE 
SLGIWSIPGY VTQRGFSK 

EF030-1 (SEQ ID NO: 113 ) 

TGATTGACAC ATAGGGGGAA TAGTATGAAA AAGTTAAAAA TGATGGGGAT TATGTTATTT 

GTTAGTACGG TCTTGGTAGG TTGTGGCACA ACAGCAGANA CAAAAATAGA CGAGAAAGCA 

ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT 

TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 

GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 

C C AG AAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 

TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 

CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 

ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 

TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 

TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 

GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 

TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 

CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 

GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 

GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 

TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 

TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 

TTTGTATATA ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 

ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT GATAGTGACC GA 

EF030-2 (SEQ ID NO:114} 

MKK LKMMGIMLFV STVLVGCGTT AXTKIDEKAT EKTSVSKKVL NLMENSEIGS 
MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIFAAAKEMP EISEDGKRYT IKLREDGKWS 
NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 
EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 
DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 
IERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDKKSL VDNILADGSK EIYGYIPEKF 
VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDGD SDR 

EF030-3 ( SEQ ID NO:115} 

GAGAAAGCA 

ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT TTAAATTTAA TGGAGAACTC GGAAATCGGT 
TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATGTCTTT 
GAAGGGTTAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 
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CCAGAAATTT CTGAGGATGG CAAACGATAT ACCATTAAAC TAAGAGAAGA TGGCAAGTGG 
TCCAATGGTG ATGCTGTAAC AGCCAATGAT TTCGTTTTTG CTTGGCGTAA ATTAGCGAAT 
CCCAAAAACC AAGCCAATTA CTTTTTCTTG TTAGAAGGAA CGATTCTGAA CGGAACAGCT 
ATTACAAAAG AGGAAAAAGC ACCAGAGGAA TTGGGTGTCA AAGCGCTTGA TGATTATACT 
TTGGAGGTTA CTTTAGAAAA GCCTGTACCA TATTTTACGT CGTTATTGGC ATTTTCTCCA 
TTTTTCCCAC AAAACGAAGC ATTCGTGAAA GAAAAAGGAC AAGCCTATGG CACTTCTAGT 
GAAATGATTG TATCTAATGG TCCGTTTTTA ATGAAAAATT GGGATCAGTC AGCGATGTCG 
TGGGATTTTG TGCGTAATCC CTACTATTAC GATAAAGAAA AAGTAAAATC AGAAACGATT 
CATTTTGAAG TTCTTAAAGA AACCAATACC GTTTATAATT TGTACGAATC AGGTGAATTA 
GATGTGGCTG TCTTAACAGG AGATTTTGCT AAACAAAATC GAGACAACCC AGACTATGAA 
GCAATCGAAC GGTCAAAAGT CTATTCCTTA CGTTTAAACC AAAAAAGAAA CGAAAAACCA 
TCCATTTTTG CAAATGAGAA TGTCCGCAAA GCTTTAGCTT ATGCTTTGGA TAAAAAAAGT 
TTAGTCGATA ATATTTTAGC AGATGGCTCA AAAGAAATTT ATGGGTACAT TCCAGAAAAA 
TTTG TAT AT A ACCCAGAAAC GAATGAAGAT TTTCGTCAAG AAGCAGGCGC TCTTGTCAAA 
ACAGACGCCA AAAAAGCCAA AGAGTATTTA GATAAAGCAA AAGCAGAGCT AAACGGAGAT 
GTAGCCATTG AACTTCTTTC AAGAGATGGT 

EF030-4 (SEQ ID N0:116) 



EKAT EKTSVSKKVL NLMENSEIGS 

MDSIFTQDEA SINAQSNVFE GLYQLDEKDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS 

NGDAVTANDF VFAWRKLANP KNQANYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL 

EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW 

DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAK QNRDNPDYEA 

IERSKVYSLR LNQKRNEKPS I FANENVRKA LAYALDKKSL VDNILADGSK EIYGYIPEKF 

VYNPETNEDF RQEAGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDG 

EF031-1 (SEQ ID NO:137) 

TGAGAAATTA GTTATTTTAG AAAAATAAAA ACCATTTTGG AGGAAGATTT AAAAATGAAA 
AAACGCGTAA TTTTAGGGAC ATTAGTCGCT GCAACGTTAT TAATGACTGC TTGTGGAAAC 
AGCGAAGCAA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTTTAGT CGTTTCAACT 
TTCGGATTAA GTGAAGATAT TGTCAAAAAA GACATTATCG CTCCATTTGA AAAAGAGAAT 
GAAGCGAAAG TTACCTTAGA AGTAGGCAAT AGCGCAGACC GCTTTACGAA ATTAAAAAAT 
AATCCCAATG CGGGAATTGA TGTCATTGAA TTAGCACAAG CAAATGCAGC ACAAGGTGGA 
AAAGATGGGT TATTTGAAAA AATTACAGAA AAAGAAGTAC CTAATTTAAG TCAGTTAACG 
CCGGGAGCAA AAGAGGTTTT TGAAAGTGGT GCTGGCGTAC CAATCGCTGT AAACAGTATC 
GGGATTGTTT ACAACAAAGA AAAATTAGGC AAAGAAATTA AAAACTGGGA TGACTTATGG 
TCAGCTGATT TGAAAGGTAA AATTTCTGTT CCAGACGTTG CCACGACGGC AGGTCCTTTA 
ATGTTATACG TTGCTAGTGA ACATGCTGGT CAAGATATTA CAAAAGATAA CGGGAAGGCC 
GCTTTTGAAG CGATGAAAGA ATTAAAACCA AACGTTGTTA AAACGTATTC AAAATCGTCA 
GACTTAGCNA ATATGTTCCA ATCTGGTGAA ATTGAAGCAG CTGTGGTTGC TGATTTTGCG 
GTTGATATTA TTCAAGGCGC ACAGAAAACG TGA 

EFO031-2 (SEQ ID NO: 118) 

MKK RVILGTLVAA TLLMTACGNS EATTKSESKG GSNALWSTF 

GLSEDIVKKD IIAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQKT 



EF031-3 (SEQ ID NO:119) 
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AA CTACGAAAAG CGAGAGCAAA GG7GGAAGTA ATG3TTTAGT CGTTTC AACT 



TTCGGATTAA 


GTG AAGATAT 






ATTATCG 


TTCCATTTGA 


AAAAGAGAAT 


GAAGCGAAAG 


Z TA.CCTTAGA 




J~l ^ 


3CAGACC 


GCTTTACGAA 


ATTAAAAAAT 


AATCC CAA TG 


C^aGG/vA * TGA 


7GTCATTGAA 


T . rt 


GCACAAG 


CAAATGCAGC 


AGAAGGTGGA 


AAAGATGGGT 


TATTTGAAAA 


AATTAC AGmA 


AA_ri 


3AAGTAC 


CTAATTTAAG 


TCAGTTAACG 


CCGGGAGCAA 


AAGA.GGTTTT 


T 3AAAGTGGT 




G3C 37AC 


:aatcgctgt 


AAACAGTATC 


GGGATTGTTT 


AGAACAAAGA 


AAAATTAGGC 


AAAGAAATTA 


AAAACTGGGA 


TGACTTATGG 


TCAGCT3ATT 


TG AAA G G T AA 


AATTTCT3TT 


CCA 


3ACGTTG 


CCACGACGGC 


AGGTCCTTTA 


ATGTTATACG 


TTGCTAGTGA 


a:atgctggt 


CAA 


3ATATTA 


CAAAAGATAA 


C3GGAAGGCC 


GCTTTTGAAG 


CGATGAAAGA 


ATTAAAAC "A 


AAC 


3TTGTTA 


AAACGTATTC 


AAAATCGTCA 


GACTTAGCNA 


ATATGTTCCA 


ATCTG 3TGAA 


ATT 


3mA jC AG 


CTGTGGTTGC 


TGATTTTGCG 


GTTGATATTA 


TTCAAGGCGC 


ACAGAAAA 











EF0 31-4 (SEQ ID NO: 120) 
TTKSESKG GSKALWSTF 

GLSEDI VKKD I IAPFEKENE AKVTLEVGNS ADRFTKLKNN PNAGIDVIEL AQANAAQGGK 
DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVKSIG IVYNKEKLGK EIKNWDDLWS 
ADLKGKISVP DVATTAGPLK LYVASEHAGQ DITKDNGKAA FEAMKELKPN WKTYSKSSD 
LANMFQSGEI EAAWADFAV DIIQGAQK 



EF032-1 { SEQ ID NO:121) 

TGAATAAATT ATTTAGGAGG AATTATGATG 

GTTTGTGGTA TTTCACTACT TACTGCTTGT 

AAGTCAACCA GTCAATCTAG CAGCACAGTT 

TCAGGGGAAT ATTCAGTTGG AAAAGATATT 

CAACTAGATG ATAAATCGAG CATAGTTCTT 

AACCATGACT TATACGGAGT GGGAAACAAG 

CTCACATTCG AAACTGCCGA CAAAGATTTT 

CAAGAATATA TGAAAAATCC AGTATCNAGT 

TCTGATGTTT CTAAAAGTAG TAGCCAAGAT 

GAAGTAAGTA CTGAAGCGAA GTCTGATGTA 

AATAC TAATG ACATTACTAA GCTAGCAGAT 

GATACTTTAG CTAAGCATCA ATTTAATGAT 

TCAATTATCG GCGTCATCCC AACCATGGAC 



AAAAAATTAA TTAGTTTAGG ATTGGTTTGT 
NCGGGAAATA ATGATAATAA AGATACTGAA 
AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
GATCCTGGAG ATTACTATGC TGTATTAACT 
ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AAAAAAGTAT CTCTTAAAAA GGGAGATACT 
GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
ACTGAAACTA GCAAACANAA AACAGTAAAC 
AATAAACAAT CTGATGTATC TGAAAAAAAA 
GCTACTAATA CTTTACCGAG CGAAGATAAA 
GAGCCAACCT TAGAACAACA AACCGTCTTA 
ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
GCAAAAAGAT GGTAA 



EF032-2 ( SEQ ID NO:122) 

MK KLISLGL.VCV CGISLLTACX GNNDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 
IIGVIPTMDA KRW 



EF032-3 (SEQ ID NO:123) 
TA ATGATAATAA AGATACTGAA 

AAGTCAACCA GTCAATCTAG CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTTGTTGCG 
TCAGGGGAAT ATTCAGTTGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT 



WO 98/50554 



PCT/US98/08959 



113 

TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

CAACTAGATG ATAAATCGAG CATAGTTCTT ATTACCGTCA AATCAGGCGG AGAAAATAGT 
AACCATGACT TATACGGAGT GGGAAACAAG AAAAAAGTAT CTCTTAAAAA GGGAGATACT 
CTCACATTCG AAACTGCCGA CAAAGATTTT GTTGTTAGAT TTTTAAATGA AAAAGATTTT 
CAAGAATATA TGAAAAATCC AGTATCNAGT ACTGAAACTA GCAAACANAA AACAGTAAAC 
TCTGATGTTT CTAAAAGTAG TAGCCAAGAT AATAAACAAT CTGATGTATC TGAAAAAAAA 
GAAGTAAGTA CTGAAGCGAA GTCTGATGTA GCTACTAATA CTTTACCGAG CGAAGATAAA 
AATACTAATG ACATTACTAA GCTAGCAGAT GAGCCAACCT TAGAACAACA AACCGTCTTA 
GATACTTTAG CTAAGCATCA ATTTAATGAT ATGTATCCTT ATAAAGGAAG CAAAATGCAT 
TCAATTATCG GCGTCATCCC AACCATGGAC GCAAAAAGAT GG 



EF032-4 (SEQ ID NO:124) 

NDNKDTEK STSQSSSTVK QPNSKDFVAS 
GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI 
TFETADKDFV VRFLNEKDFQ EYMKNPVSST 
VSTEAKSDVA TNTLPSEDKN TNDITKLADE 
IIGVIPTMDA KRW 



TVKSGGENSN HDLYGVGNKK KVSLKKGDTL 
ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE 
PTLEQQTVLD TLAKHQFNDM YPYKGSKMHS 



EF033-1 (SEQ ID NO: 125) 

TGACTGCTTT TTTTCTATTG GAGAAAAAAG 
C AAAGG AGGT TCATTTCAGA AAATTTTCCC 
AAAATGAAAA AATTTACTTT AACAATGATG 
GCAGGATGTG GTAAACAGGA AAAGAAAGCA 
TTACCAACCA AAGACCGTAG CGGCAAAGAA 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG 
ATCGCAGTTG ATACTCAAAG TAGTACAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA 
GAAATCGACG AG TAG 



TGGTTTTTTT GTATTGTTTT GACGTTGAGA 
CAAAATAAAA TAG A C G AATG CGAGGATGAA 
ACTTTAGGTT TAGTAGCAAC ACTTGGCTTA 
ACTACCTCTT CTGAAAAAAC AGAAGTAACG 
ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATGACTGATT TAAAAAAATT ACCACAAATG 
ATTGCCTTGA AACCACAAAT TGTTTATGTG 
GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
GGACAAAAGT TAATCAAAAC AATGGATCAA 



EF033-2 (SEQ ID NO:126) 
MKKFTLTMMT LGLVATLGLA 

GCGKQEKKAT TSSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI 
AVDTQSSTMM TDLKKLPQMD MMAVDAEKL I ALKPQIVYVN DINLASSESV WKQVEDAGIT 
WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE 

EF033-3 (SEQ ID NO:127) 

CTCTT CTGAAAAAAC AGAAGTAACG 

TTACCAACCA AAGACCGTAG CGGCAAAGAA ATTACTTTAC CCAAAGAAGC AACCAAAATT 
ATTTCCCTAG TGCCATCAAC AACAGAAGTG ATTGAAGACT TAGGTAAAAC CGACCAATTA 
ATCGCAGTTG ATACTCAAAG TAGTACAATG ATGACTGATT TAAAAAAATT ACCACAAATG 
GATATGATGG CTGTCGATGC CGAAAAATTG ATTGCCTTGA AACCACAAAT TGTTTATGTG 
AATGACATCA ATTTAGCTAG CTCAGAAAGT GTTTGGAAGC AAGTGGAAGA TGCTGGAATT 
ACAGTCGTTA ATATCCCCAC TAGTACAAGC ATCAAAGCAA TCAAAGAAGA CGTCCAATTC 
ATCGCTGATA GCTTATCTGA ACATGAAAAA GGACAAAAGT TAATCAAAAC AATGGATCAA 
GAAATCGACG AGTAG 
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EF 3 3 2-4 (SEQ ID NC : 126) 

SSEKTEYTL PTKDRSGKEI TLPKEATKII SLYPSTTEYI EDLGKTDQLI 

AVDTQSSTXK TDLKKLP^D MMAYDAEKLI ALXPQIYYVN DINLASSEEV WKQVEDAGIT 

WNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE TCP 

EFC34-1 { £EQ ID ICC : 12 9 ) 

TAGGAGGGAG TAA7CATGAA AAAAATCGGG TATTTTAGTT GTATTATTTT TTTCATGTTT 

TTGGTAGGTT GTAGTAATAA CAAAAAAGAA AACGGCAA7C TTTTGAA TGC CAGTTCGTTT 

CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 

TTCAACAAGG ATAAAACCAT G AC GCTTG AA AAAGAATATT TAGTTAATCC CAATAATGAA 

GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 

AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 

AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-2 (SEQ ID NO:13C) 

MKKIGY FSCI IFFMFL VGC SNNKKEN GNLLNASSFP LILTTI IEKE EDLTKGS IFF 
NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEI IGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 

EF034-3 (SEQ ID NO: 131) 

AGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT 

CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TAACGAAAGG TTCAATTTTT 
TTCAACAAGG ATAAAACCAT GACGCT7GAA AAAGAATATT TAGTTAATCC CAATAATGAA 
GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AAAATATTAA AATACAAGAA 
AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAACGAAAAA AATAGAGTTT 
AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GTAATGTGTA TGGTGATTTT 
GGTGGTAAAT AG 

EF034-4 (SEQ ID NO:132) 

KEN GNLLNASSFP LILTTI IEKE EDLTKGS IFF 

NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK 
KVDEGKRISD AEGNVYGDFG GK 



EF035-1 (SEQ ID NO:133) 

TAAACGAGAG GTGAGTTTAT GAAAACAAAA ATCGGAAAAA CAGTTATCTT GTCAGCATTT 

TTATTCACAA GTTTCCTTTT ACTGAGTGGT TGTACCTCGG CTGGCGAAGA GATGGAAAAA 

ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT 

GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC 
AAAATCGATA CTACTGAGTA A 

EF035-2 ( SEQ ID NO:134) 

MKTKI GKTVILSAFL FTSFLLLSGC TSAGEEMEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 
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EF035-3 ( SEQ ID NO:135) 
GATGGAAAAA 

ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATAAGCAGAA ACATAAAAAT 
GAAAATTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTAGATAG TCAAGAAGAC 
AAAATCGATA CTACTGAG 

EF035-4 ( SEQ ID NO:136) 



MEKT IDRQKEKVDK TVDKQKHKNE 
NSMESYDEKV DRSLDSQEDK IDTTE 



EF036-1 (SEQ ID NO:137) 

TAATTTTCAA GTCCTACATA TAATGGTAAA ATAGAATGGA TTGAAATTAA TTGGAGGAAT 

AATGAATCGA TGAAAAAAAG ATTGCTATTA TTTATTGGTT TGGCAAGTAT AC TT AC TTTG 

ACAGGATGTG CAAAATGGAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 

TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 

ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 

GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 

ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 

AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAAC TG G AAA 

GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 

CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 

GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 

GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT GTACACTCGT 

AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAGA TGATATCCAA 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA ATAA 

EF-36-2 (SEQ ID NO:138) 

MKKRLLLF IGLASILTLT GCAKWIDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
DHKVAWGIT PIVNKISTVGVK DISMENLKKI FLGEVTNWKE LGGKDQKIVI LNRAAGSGTR 
ATFEKWLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA FSYVTDEVAT LSIDGVQPTD 
ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP VSKMEIERDW 
QGNVIK 

EF036-3 (SEQ ID NO:139) 

GAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT 

TTACAACCAT TAGTAGAGAC AGCGAGTGAG GAATATCAAA GCCAAAATCC GGGAAGATTT 
ATTAATGTCC AAGGTGGCGG AAGCGGAACA GGTCTGAGTC AAGTCCAATC TGGCGCGGTA 
GACATTGGTA ATTCTGATTT ATTTGCAGAA GAGAAAAAGG GCATCAAAGC GGAAGACTTA 
ATTGATCATA AAGTTGCTGT CGTTGGGATT ACACCAATCG TTAACAAAAA TGTCGGTGTC 
AAAGATATCT CAATGGAAAA TTTAAAGAAA ATCTTTTTAG GTGAAGTAAC AAACTGGAAA 
GAACTTGGCG GGAAAGACCA AAAAATTGTT ATTTTGAATA GAGCGGCCGG TAGTGGTACG 
CGTGCGACTT TTGAAAAGTG GGTCTTGGGA GATAAAACAG CCATTCGTGC GCAAGAACAA 
GATTCCAGCG GCATGGTTCG TTCCATTGTT TCTGATACAC CAGGAGCGAT TAGTTATACC 
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GCATTTTCAT ATGTTACTGA TGAAGTAGCT ACGTTAAGTA TTGATGGTGT TCAGCCAACA 
GATGAAAATG TAATGAACAA TAAATGGATT ATTTGGTCTT ATGAACACAT SaCACTCGt" 
AAAAATCCAA GTGATTTAAC CAAAGAGTTT TTAGACTTTA TGTTGTCAgI SSSS 
GAACGTGTGA TTGGTCAATT AGGGTATATT CCTGTTTCGA AAATGGAAAT TGAACGGGAT 
TGGCAAGGAA ATGTCATTAA A ^aal^ai 

EF036-4 (SEQ ID NO:140) 

IDRGE SITAVGSSAL 

QPLVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD IGNSDLFAEE KKGIKAEDLI 
DHKVAWGIT PIVNKNVGVK DISMENLKKI FLGEVTNWKE LGGKDQKIVI SSSSSS 
^ WLGD KTA1 ^ D SSGMVRSIVS DTPGAISYTA FSYVTDEVAT L^qSd 
ENVMNNK WII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP 



EF037-1 ( SEQ ID NO:141) 

tcc ™aa tcagttatga taaaggaaga aataaataaa 

TTTTTTGGAG GCATTTTCAT GAAAATGTCT AAAGTACTCA CCACTGTTTT GACGGCAACT 

gctgctcttg tgttgcttag tgcttgttca tctgataaaa aaacagatag SgScSS 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCaS SgSccS 
CCTGAAGAGC TCGAAATGGC GTTAAGTGAT AAAGGAAATT GGATTGTCGC IgcSc^C 
AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACTT TCCATGATAA GGGGaSgAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA SSSi^J 
GCTGAATATG AAATCACGGT TCCTAAGCTA atcgtttctt ctgaaaattt ct^ctS 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTAcS A^SSaCC 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA ATAA ^llAGAA 

EF037-2 (SEQ ID NO: 142) 

MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTVKGDIEVK SSftSS 
VNGN ITFDKQ EYKDSADLEK DGATVTGEVT VANN J- VKGDIEVK ANGFTLNGTK 

EF037-3 (SEQ ID NO: 143) 

AACAGATAG TAGTTCTAGT 

AGCAAAGAAA CAGCTAATTC AAGTACAGAA GTAGTCTCTG GTGCTTCAAT TAGTGCCAAC 
AA^ AGC TCGAAATCGC ^AAGTGAT AAAGGAAATT GGaSSSc ISacSa"c 
iJSSS TTGATAAAGA GGTAACAGTT GCTGGTACTT TC CATGATAA GGGGAAAGAT 
TCCAACGATG TCTATCGTAA ATTAGCACTT TATTCCCAAG ATGATAATAA AAAAGTAACT 
GCTGAATATG AAATCACGGT TCCTAAGCTA ATCGTTTCTT CTGAAAATTt" JiJSSJSS 
CACGGGACTG TCAAAGGTGA TATTGAGGTG AAAGCAAATG GCTTTACTTT A^SSS 
AAAGTTAATG GCAATATTAC TTTTGATAAA CAAGAATACA AAGATTCTGC T^CtSgS 
AAAGATGGTG CCACTGTTAC TGGTGAAGTC ACCGTAGCCA A JaSACTTAGAA 

EF037-4 (SEQ ID NO: 144) 

TDSSSSS 

KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKrxnq 
NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH GTvSSSS SgFTLNgS 
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EF036-; ( SEQ ID NO: 145) 

l^T:;tl ;; CGTC - AC - aataaagags atgaagctac tcaaatggcg ttggcaatcg 

mAC/^ooATC ATaAAAAAGG agaagtgagc ATGAAAAA,\G TACTACCTTT TATTP C PTT A 
^TGTTGTC AGGTT3TGGA ACAGATATGA AAAAGATATT GACTGCCGAT 
>^>GTGvjTAA>iT GGGAACTAGA AAATAAAA.GT CCAA.CTACTA CTTACACTTT r T T TGA' T GAT 

A^TAAaI 'r^ GTATAA ACTGATAGTG GAACGTACTC TtSatSI 

«A -AATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT 
o*iAT*TAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC "GATAAAAAA 
THGAATAAGA GGTGTCTTTG A -WiiAAAAAA 

EF038-2 (SEC ID NC:146) 

M KLLKWRWQWN KDHKEGEVSM KKVLPFIALV GLLLLSGCG- DMKK T LTADG 

TTTYTFFDDE TFSRYKSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 

EFC38-3 (SEQ ID NO: 147) 

TTGTGGA ACAGATATGA AAAAGATATT GACTGCCGAT 

GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA CTTACACTTT TTTTGATGAT 
GAAACTTTTT CGAGGTATAA TTCAAAAATT AGTGATAGTG GAACGTACTC TTACGATGAA 
^l^Z^ AACTCACTTT GGATATAAAA AATAAAGAAC AATTAATAAT GGAAAATGTT 
ACGGTAAATT AAAAGGTGAA ATTGGAGGCG AGAAGGACTC TGATAAAAAA 

TNGAATAAGA GGTGTCTTTG A 

EF03 8-4 ( SEQ ID NO: 148} 
CGT DMKKILTADG 

GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE 
YKDGKLKGEI GGEKDSDKKX NKRCL 



EF039-1 (SEQ ID NO:149) 

TAAATATATC AAAAAGAAAA AAGGGGATTA 
GCGCTTACCT TATTAACCTT TAGTACGTTG 
TCTGCAACAG ATAAATCAAG TGCAGCTAGC 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA 
CAAGGCACAA AAGTTTACGA CAAAAATNAT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC 
TTAATTTCGG ATACACAAAA CTATCAAGCG 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT 
GAACATATCC CTTATTCTGA C GAG AAA TT A 
ACAGAAACTG GCAAGATTGT TACCAATGAA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC 
TTTATTAGTT TAGCGAAATT TGATAATAAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC 
GCTCACGTTT CAATTGGTGA CAATAAATAT 
GATAAACGTT TTACGTATAC ACGAATGGGT 



CCAACCATGA AAAAGAAAAA AGTTTTTAGT 
TTGATTGCAG GCTGTGCTGG CGGAGCCAAC 
TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
TTAACAGAAA TTTTATCCAG TACTGATTGG 
AATAATTTAA CAGCAGAAAA TGCTAATTTT 
GGTTTTTATG AATTTTTCGA CAAAGAAACA 
TTTGTGACAG ACGATGGCGA AAAGCGTATC 
GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
CCTGGGGATG ACATTTTAGG GGCCACATTA 
GGTAACGATG TTACTGAAGC AAATAAAATG 
ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
AAAGACAACA ATGGCAAAGA AATTAAAGTC 
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enes. 



TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 
EF039-2 (SEQ ID NO:150) 

MKKKKVFSA LTLLTFSTLL IAGCAGGANS ATDKSSAASS STAVSSSAEA 
AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANF I GLAKYDGETG FYEFFDKETG 
ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 
HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 
ISLAKFDNKT EKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF039-3 (SEQ ID NO: 151) 

TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 
GCAGCTAAAG AGCAATCAAA AGGACAAGAA TTAACAGAAA TTTTATC C AG TACTGATTGG 
CAAGGCACAA AAGTTTACGA CAAAAATNAT AATAATTTAA CAGCAGAAAA TGCTAATTTT 
ATTGGTTTAG CAAAATATGA TGGTGAAACA GGTTTTTATG AATTTTTCGA CAAAGAAACA 
GGTGAAACCC GTGGCGATGA AGGCACATTC TTTGTGACAG ACGATGGCGA AAAGCGTATC 
TTAATTTCGG ATACACAAAA CTATCAAGCG GTGGTCGATT TAACGGAAGT GACGAAAGAT 
AAATTTACCT ATAAGCGAAT GGGTAAAGAT AAAGACGGGA AAGATGTAGA AGTCTTTGTA 
GAACATATCC CTTATTCTGA CGAGAAATTA ACCTTTACGA ACGGCCGTAA AGATTTAGAA 
ACAGAAACTG GCAAGATTGT TACCAATGAA CC TGGGGATG ACATTTTAGG GGCCACATTA 
TGGAATGGCA CGAAAGTTTT AGATGAAGAC GGTAACGATG TTACTGAAGC AAATAAAATG 
TTTATTAGTT TAGCGAAATT TGATAATAAA ACAAGTAAAT ATGAATTCTT TGATTTAGAA 
ACGGGTAAAA CACGTGGAGA TTTTGGTTAC TTCCAAGTAA TTGATAATAA CAAAATCCGT 
GCTCACGTTT CAATTGGTGA CAATAAATAT GGAGCTGCAT TAGAATTAAC AGAATTAAAT 
GATAAACGTT TTACGTATAC ACGAATGGGT AAAGACAACA ATGG C AAAGA AATTAAAGTC 
TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA 

EF039-4 (SEQ ID NO: 152) 

ATDKSSAASS STAVSSSAEA 

AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN NLTAENANF I GLAKYDGETG FYEFFDKETG 

ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKDK DGKDVEVFVE 

HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDDILGATLW NGTKVLDEDG NDVTEANKMF 

ISLAKFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG AALELTELND 
KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF 

EF040-1 (SEQ ID NO: 153) 

TAGATTAGAA CCACTGGAGA AAAATCTCAT ATTTCTCTCG AGGAAAGGAA GTTGAGCACA 
ATGAACAAAA AAATTTTAAT GGGGCTATTA AGTGTCGTGA CCATTCCATT ACTTGCTGCG 
TGTCAAGGAG GAGAAACACC TTCCGCAGCG TCAAAAAATA GTCAAACGGT GACTACTCAA 
AGTAGTGCAA AAACTGAAAG CACCAGTACA ACCCGTTCGG TAGCTCAAAC AACATCAAAA 
GAGGAAGTGA AAGAACCGAT GAAGACCTAT GAAGTGGGTG CGCTTTTAGA AGCAGCCAAT 
CAACGAGATA CGAAGAAGGT CAAGGAAATT TTACAAGATA CTACTTATCA AGTGGATGAA 
GTCGACACAG AAGGCAACAC ACCGCTCAAT ATCGCTGTTC ACAATAATGA CATTGAGATT 
GCAAAAGCGT TGATTGATCG GGGTGCCGAT ATTAATCTGC AAAACAGCAT TAGTGATAGT 
CCCTATCTTT ATGCGGGAGC GCAAGGACGT ACGGAGATTT TAGCGTATAT GTTAAAACAT 
GCGACCCCAG ATTTAAATAA GCATAACCGT TACGGTGGCA ATGCGTTAAT TCCGGCAGCT 
GAAAAAGGAC ATATTGACAA TGTGAAGCTC TTGTTAGAAG ATGGACGAGA AGACATAGAT 
TTCCAAAATG ACTTTGGCTA TACAGCATTG ATTGAGGCAG TGGGGTTACG TGAAGGGAAC 
CAACTTTACC AAGATATTGT AAAATTGTTA ATGGAAAATG GTGCGGATCA ATCCATTAAA 
GACAATTCTG GTCGAACAGC AATGGACTAT GCCAATCAAA AAGGTTATAC GGAAATTAGT 
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EFC4C-2 (SEQ ID XC:154} 

M HKKILMGLLS WTIP1LAAC QGGETPSAAS KN^-T-X?* 

'ScS;S ;?H^f EVKEPMKTVE VGAoLEAANQ ROTKK^E!, QOTTYQVDEV 

-~ SSSSS ™S 5™ 1 — - 

EFC4 0-3 ( SEQ ID NC:155) 

AGCG tcaaaaaata GTCAAACGGT GACTACTCAA 

EE ™ ™ — s = 

SSS ATCGC^GTTC *™ 

S^SS ST" SSSSS 

™^ ss; ssi ss; ™s 
ssssss s ™™ M0 

-™ "~ — s SSS Sc~ 

sss sss r™ GccaaTCAM ssss 

EF040-4 (SEQ ID NO.-156) 
AS KKSQTVTTQS 

SAKTESTSTT RSVAQTTSKE EVKEPMKTYE v^r.r.r^ -.„_ 

EF041-1 (SEQ ID NO:157) 

SSSS ESS ES3S KSSS JESS EE? 

1= S= i= E= == =i 

ttagacaaag ataacaaS cSISgca S™gg SIJS^ 

=S XSSSI SSIS sss ,»~= =ss 

GAC A TTGTGG AAAAATATGG TAAAAATTAT rra^n**™ CATTCTTCCC GCAACGTCAA 
GGTCCATTCG TCTTAG A CGG C^^rrl rr~t GCGAAAGTGC TGTCTACAAT 

= sss ™ ™ ™ 1™ 

S55S = — = ™ i= 
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TABLE 1. Nucleotide and Amino Acid Scqeuences of E. faecalis Genes. 

AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 



EF041-2 (SEQ ID NO: 158) 

M KLKKSLTFGV ITLFSVTTLA ACGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 

GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 

KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 

PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 

SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 



EF041-3 (SEQ ID NO: 159) 

TTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 
AGTGGCGAAC AAGTTTTACG TGTCACAGAA CAACAAGAAA TGCCAACAGC TGATTTATCA 
CTAGCAACAG NCAGAATTAG TTTTATTGCA TTAAATAATG TATATGAAGG AATTTATCGT 
TTAGACAAAG ATAACAAAGT CCAACCTGCA GGTGCAGCGG AAAAAGCAGA AGTTTCTGAA 
GATGGACTAA CATACAAAAT TAAATTAAAT AAAGATGCAA AATGGTCAGA CGGTAAACCA 
GTGACTGCTA ATGACTATGT TTACGGATGG CAACGAACAG TTGATCCAGC GACAGCTTCT 
GAATATGCTT ATCTGTATGC CTCTGTAAAA AATGGTGATG CCATTGCTAA AGGGGAAAAA 
GATAAATCAG AATTAGGAAT TAAAGCAGTC AGTGATACAG AATTAGAAAT CACTTTAGAA 
AAAGCAACAC CATACTTTGA TTACTTATTA GCTTTCCCAT CATTCTTCCC GCAACGTCAA 
GACATTGTGG AAAAATATGG TAAAAATTAT GCATCAAACA GCGAAAGTGC TGTCTACAAT 
GGTCCATTCG TCTTAGACGG CTTTGATGGT CCTGGTACAG ATACAAAATG GTCATTCAAG 
AAAAACGATC AATATTGGGA TAAAGATACT GTGAAACTGG ACTCAGTAGA TGTGAATGTC 
GTGAAAGAAT CACCAACCGC GTTGAACTTG TTCCAAGATG GACAAACAGA CGATGTCGTT 
CTTTCTGGTG AATTAGCCCA ACAAATGGCC AATGACCCAG CTTTTGTTAG TCAAAAAGAA 
GCATCAACAC AATATATGGA ACTAAATCAA CGTGATGAAA AATCACCATT TAGAAATGCG 
AACTTACGTA AAGCAATTTC TTACTCAATC GACCGTAAAG CGTTAGTTGA ATCAATCCTT 
AGGGGATGG 

EF041-4 (SEQ ID NO:160) 
CGGGGTSDS SSASGGGKAS 

GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG AAEKAEVSED 
GLTYKIKLNK DAKWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN GDAIAKGEKD 
KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFFPQRQD IVEKYGKNYA SNSESAVYNG 
PFVLDGFDGP GTDTKWSFKK NDQYWDKDTV KLDSVDVNW KESPTALNLF QDGQTDDWL 
SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN LRKAISYSID RKALVESILR 
GW 

EF044-1 (SEQ ID NO:161) 

TAAGATAAAA TTAGTTATAG CGTCTATAGG AGGAATAGTA TGAAAAAATT AGTTTGTGTT 
ATTTTAGTTA TTTTTTTAAC AGGTTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 
GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TCTTAGTAGC TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTGTTTGG GGCTGAAAAT GGTCAAGTGA AAG AAGC CAT TAATAAATAT 
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TABLE i. Nucleotide and Amino Acid Seqeuences of E. laecalis Genes. 

GA3CTATCTA 7AAATACAAT TC GTACAAAA GA3ATTAGAA TATTTTCAGC G3CCGATAAC 

^;l":; G ;" CC3 - TAAA3 - AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 

C-A.^.AAA mTAGATGGGG CAC.T.ATAC TTA7TCAAGA ATAAAAATGG TTCGATA^CT 

™^^: C CTAACTACGr raCAATGT7 ACGGATGATC AAAAAGACGT TATGCTAGAA 

EF044-2 !SEQ I- NO : 1 6 2 ) 

KKKLVCVI LVIFLTGCSS QKANEPKKQE NSTKHTTSIK SSTNHYSSS T 
ETSSNNKLKE TSESASTTQT SEKSKJEVST NYEEANSLEA TPYAVDLSSL NNPLVFNFKG 
KNVPTSITLE NLNSTPTATF RTKLFGAEKG QVXEAIMKYE LS INTIPTKE IPIF^AADNS 
IRTVKVOTEL ILGTNISS.NT EQKRSGTLVL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 

EF044-3 (SEQ ID NO: 163) 

TTGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA 

GAAAATTCTA CCAATCATAC AACATCAATA AAAAGCAGTA CTAATCATTA CAGTTCTAGC 
ATAGAAACAA GCTCTAATAA TAAACTAAAA GAAACTTCAG AAAGTGCCAG CACCACTCAA 
ACTTCGTCAA AGTCGAAAAA TGAAGTATCT ACAAATGTCG AAGAAGCAAA TTCTTTAGAA 
GCAACACCTT ATGCTGTCGA TC TTAG TAG C TTAAACAATC CACTCGTATT TAATTTTAAA 
GGAATGAATG TGCCAACTTC AATTACGTTA GAGAACTTAA ATTCAACACC AACTGCTACC 
TTCCGAACTA AATTG TTTGG GGCTGAAAAT GGTCAAGTGA AAGAAGCCAT TAATAAATAT 
GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTTTCAGC GGCCGATAAC 
AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTTAG GAACTAATAT TTCTTCAAAC 
GATGAACAAA ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG ^TCGATATCT 
TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATGATC AAAAAGACGT TATGCTAGAA 



EF044-4 (SEQ ID NO:164) 

CSS QKANEPKKQE NSTNHTTSIK SSTNKYSSSI 

ETSSNNKLKE TSESASTTQT SSKSKNEVST MVEEANSLEA TPYAVDLSSL NNPLVFNFKG 
MNVPTSITLE NLNSTPTATF RTKLFGAENG QVKEAINKYE LSINTIPTKE IRIFSAADNS 
IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL ITPNYAGNVT DDQKDVMLEV 

EF045-1 (SEQ ID NO:165) 

TAGCCAAAAA ATGAGGGAGG AAAAGAGATG AACAAGAAAC GGATTTTAGG TGCAATCACG 
TTAGCTTCTG TGTTAGTATT CGGGTTAGCT GCATGTGGTG GCGGCAATAA AGGCGGGgS 
AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
™™ AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGaStI 
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GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAG CG AAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
GACGTAGACG GCGATGGCAT TCGCGAAGAC AAAGAAGGCA AACCACTAGA AATCAAGTTT 
GCTTCAATGT CAGGCGGCGA AACTGCACAA CCACTTGCTG ATTACTATGT CCAACAATGG 
AAAGAAATTG GCTTAAACGT AACGTATACA ACAGGACGCT TAATTGATTT CCAAGCATTC 
TATGATAAAT TGAAAAATGA TGACCCAGAA GTAGATATCT ATCAAGGCGC GTGGGGCACA 
GGTTCAGATC CTTCACCAAC CGGCTTATAT GGTCCAAACT CAGCCTTTAA CTATACACGT 
TTTG AG TC AG AAGAAAATAC TAAATTACTT GATGCGATTC ATTCAAAAGC ATCATTTGAT 
GAAGAAAAAC GTAAAAAAGC CTTCTACGAT TGGCAAGAGT ATGCCATTGA TGAAGCGTTT 
GTAATCCCAA CGCTTTACAG AAATGAAGTC TTGCCTGTCA ACGACCGTGT AGTTGACTTT 
ACTTGGGCAG TTGATACGAA AGATAATCCA TGGGCAACGG TGGGTGTCAC AGCAGACTCA 
CGGAAATAA 



EF045-2 (SEQ ID NO:166) 

MN KKRILGAITL ASVLVFGLAA CGGGNKGGGN KATETEDISK MPIAVKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY DAQYMLPTVQ PLFNNDADFK IVDGGPADLK 
LDEDANTATI KLRDNLKWSD GKDVTADDVI FSYEVIGHKD YTGIRYDDNF TNIVGMEDYH 
DGKSPTISGI EKVNDKEVKI TYKEVHPGMQ QLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
RKNPVTIGPY YMSNIVTGES VEYLPNEHYY GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
ALSMPTDTYP TYKDTEGYQI LGRPEQAYTY IGFKMGTFDK ETNTVKYNPK AKMADKSLRQ 
AMGYAIDNDA VGQKFYNGLR TGATTLIPPV FKSLHDSEAK GYTLDLDKAK KLLDDAGYKD 
VDGDGIREDK EGKPLEIKFA SMSGGETAQP LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL PVNDRWDFT WAVDTKDNPW A TVG VTADSR 



EF045-3 (SEQ ID NO:167) 
ATGTGGTG GCGGCAATAA AGGCGGGGGC 

AATAAAGCAA CGGAAACAGA AGACATTTCA AAAATGCCAA TCGCTGTTAA AAATGATAAA 
AAAGCAATTG ATGGCGGTAC ATTAGATGTC GCTGTAGTTA TGGATACACA ATTCCAAGGA 
CTTTTCCAGC AAGAATTTTA TCAAGACAAC TATGATGCAC AATACATGCT TCCAACGGTA 
CAGCCATTAT TTAACAATGA TGCAGACTTT AAGATTGTCG ATGGGGGTCC TGCGGATCTG 
AAATTAGATG AAGATGCCAA TACAGCAACC ATTAAATTAC GTGACAATTT GAAATGGTCT 
GACGGTAAAG ATGTGACAGC CGATGACGTG ATTTTCTCTT ATGAAGTCAT TGGTCATAAA 
GACTATACAG GGATTCGTTA TGATGATAAC TTTACGAATA TTGTTGGCAT GGAAGACTAC 
CATGATGGTA AATCGCCAAC CATTTCTGGC ATAGAAAAAG TCAATGATAA AGAAGTTAAA 
ATCACTTATA AAGAAGTTCA CCCAGGAATG CAACAATTAG GTGGCGGTGT TTGGGGCTCA 
GTTTTACCAA AACATGCCTT TGAAGGAATT GCTGTTAAAG ACATGGAATC AAGCGATGCA 
GTTCGTAAAA ACCCTGTGAC TATTGGACCA TACTACATGA GTAATATTGT GACAGGTGAA 
TCTGTTGAAT ACCTACCAAA TGAGCATTAC TACGGTGGTA AACCTAAATT AGATAAATTA 
GTGTTCAAAT CTGTTCCTTC TGCGAGCATT GTAGAAGCGA TGAAAGCGAA ACAATACGAT 
ATTGCATTAT CAATGCCAAC AGATACGTAT CCAACATACA AAGATACTGA AGGGTATCAA 
ATCTTAGGAC GTCCCGAACA AGCCTACACG TATATTGGCT TTAAAATGGG TACGTTTGAC 
AAAGAAACAA ATACAGTGAA ATACAATCCA AAAGCTAAAA TGGCAGATAA AAGCTTACGT 
CAAGCCATGG GCTATGCAAT TGACAATGAT GCAGTCGGCC AAAAATTCTA CAACGGCTTA 
CGAACAGGGG CAACAACGTT AATCCCACCA GTCTTCAAGA GCTTGCATGA TAGCGAAGCG 
AAAGGCTATA CGCTTGATTT AGACAAAGCG AAAAAATTAT TAGACGATGC TGGTTATAAA 
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GACGTAGACS GCGATG 3" A" ^-nr * - -7^ ? <- — , ^„ 

G— CAATHT "--^ A - — — —A AACCACTAGA AATCAAGTTT 

AAAGAAATTG T^ZZ? 8 ^ ATTACTATGT CCAACAATG3 

-AT-ATAAAT — «T«.„ „^- JOrt ...JCT TAATTGATT7 CCAAGCATTC 

-«T„ATAA«T .cuuiAArtTGA T3ACCCAGAA GTAGATATCT ATCAAGGCG" n-r-rnri 

GGTTCA3ATC CTTCACCAAC A "V- — ^ * ^ AAOCjCG^. GToGoGCACA 

^ T ..t-r — --C^CT CAGCCTTTAA ctatacacgt 

rrr^ iAC ^--a=tt -atgcgattg attcaaaagc atca—^at 

--^^GC CTTCTACGAT 7GGCAAGAGT ATGCCATT^ ^GAA-G— 

AC^OGGCaI SaUS A W ^S SS™" ACGACCG ™ AGTTGACTTT 

CGGAAA AoaTAmT.CA .^GcAACGG TGGGTGTCAC AGCAGACTCA 



EF04 5-4 ( SEQ ID NO : I 6 8 ) 

CGGGKKGGGV. KATETEDISK M? I A VKNDKK 
AIDGGTLDVA WMDTQFQGL FQQEFYQDNY 
LDEDANTATI KLRDNLKWSD GKDVTADDVI 

dgksptisg: ekvndkevki tykevhpgmq 

RKNPVTIGPY YMSNIVTGES VEYLPNEKYY 

ALSMPTDTYF TYKDTEGYQI LGRPEQAYTY 

AMGYAIDNDA VGQKFYNGLR TGATTLIPPV 

VDGDGIREDK EGKPLEIKFA SMSGGETAQP 

DKLKNDDPEV DIYQGAWGTG SDPSPTGLYG 

EKRKKAFYDW QEYAIDEAFV IPTLYRNEVL 
K 



E AQYKLPTVQ PLFNNDADFK IVDGGPADLK 
FSYEVIGKKD YTGIRYDDNF TNIVGMEDYH 
tLGGGVWGSV LPKHAFEGIA VKDMESSDAV 
GGKPKLDKLV FKSVPSASIV EAMKAKQYDI 
IGFKMGTFBK ETNTVKYNPK AKMADKSLRQ 
FKELHDSEAK GYTLDLDKAK KLLDDAGYKD 
LADYYVQQWK EIGLNVTYTT GRLIDFQAFY 
PNSAFNYTRF ESEENTKLLD AIDSKASFDE 
PVNDRWBFT WAVDTKDNPW ATVGVTADSR 



EF046-1 (SEQ ID NO:169) 

TAGGAGGATA TAATGAAAAA AAAACTTATT 
TGTAGTAATA ATACTGGGGG AAAAAATAGC 
CAGCAAACTA CCCAGTCTTC TAAAAAAGAT 
ACATCATCTA TAACAATTGA AACAACCGAG 
GATGATGTTT CAAAAACTAG ACGACAATTG 
ACGGATAAAG AAC TAAAGG A ATATATATCA 
AATTATATTA AGCAAAAA 



CTACTATTGT TAGCCTTATT TTTAACGGCA 
GACGCTTCAT CTACTGAAGT ATCAACTAAG 
AGTAGTAATC CGGACACAAC ACCAACTTCT 
AATTTAAAGA ATAGAGAATT GAATCCAACA 
TATGAACAAG GAATTAACAG TTCAACAATT 
GAGGCTAAAG AACAAAAGAA AGATGTCATT 



EF046-2 (SEQ ID N0:170) 

SSSten^kmp^ asstevstkq qttqsskkds snpdttptst 

yikqk lknrelnptd dvsktrrqly eqginsstit dkelkeyise akeqkkdvin 



YIKQK 

EF046-3 (SEQ ID NO: 171 
A 



r*™f ^ ATACTGGGGG AAAAAATAGC GACGCTTCAT CTACTGAAGT ATCAACTAAG 

GATGATGTTT CAAAAACTAG ACGACAATTG TATGAACAAG GAATTAA^aI SSSJSi 
iSSSSS iSSSST GAGGCTAAAG AACAAAAGAA SSESE 
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EF046-4 (SEQ ID NO: 172) 

C SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST 

SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AKEQKKDVIN 
YIKQK 

EF047-1 (SEQ ID NC:173) 

TAGGGAAAAC AAGGAGGAAT TCTTATGAAA AAGATAGGGC TTATTTC TAG TGCTTTTCTT 
TTAACCCTTG CTTTAGCAGC ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 
TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TC CGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAA C ATATG G CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 

EF047-2 (SEQ ID N0:174) 

MKK IGLISSAFLL TLALAACGGG KSTENTDSRS SAAESTTVES TKASATKESS 
SKATTKSSDA KPSGTTTADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN 
MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAI PVNDAR VNQLTPISSF 
EKKTYGSDAE AKNAVNQI ID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
INGESPDDLA KNWKILEQE TLPAPNTVGQ ITLNVAGTTD YNRNSWWQA GTWYSVHHF 
DPIQAVKMAT SM 

EF047-3 (SEQ ID NO:175) 

ATGCGGCGGC GGAAAAAGTA CAGAAAATAC GGATAGTCGT 

TCCAGTGCTG CGGAAAGTAC CACAGTCGAG AGTACAAAAG CATCTGCTAC AAAAGAATCA 
AGTAGCAAAG CAACAACAAA ATCTAGTGAT GCGAAACCGT CAGGAACAAC AACAGCTGAT 
TCGAAAGCAA CAGCTTCTTC TACGAAGGAA GCGGCAAATA ATGGCTCAGC AGAGAAGCAA 
TCACCAGCGA AAAATGCGAA TCCAGATGAC CAAGCCAACC AAGTGCTTAA CCAGCTAGCA 
AACATGTTTC CTGGTCAAGG CTTACCGCAG GCAATTTTAA CGAGTCAAAC GAATAACTTT 
TTAACTGCAG CGACAACTTC ACAAGCGGAT CAAAACAATT TC CGTGTTTT ATATTATGCA 
GAAAAAGAAG CGATTCCAGT GAATGATGCA CGTGTCAATC AGTTAACGCC AATTAGTTCT 
TTTGAGAAAA AAACATATGG CTCTGATGCC GAAGCAAAAA ATGCAGTGAA CCAAATCATT 
GACAATGGCG GTCAACCAGT AGATTTAGGT TACAATATTA CTGGGTATAA ACAAGGGGCG 
GCAGGTTCTA GTTACTTATC TTGGCAAGAA GGCAATTGGA GTTTAGTCGT ACGGGCCTCA 
AATATCAATG GTGAATCGCC TGATGATTTA GCGAAAAATG TTGTCAACAT TTTGGAACAA 
GAAACATTAC CAGCACCGAA TACCGTTGGT CAAATCACAC TGAACGTGGC AGGAACCACT 
GACTATAATC GAAACTCAGT AGTTTGGCAA GCCGGTACAG TCGTTTACTC TGTCCATCAT 
TTTGACCCAA TTCAAGCAGT GAAGATGGCA ACATCAATGT AA 

EF047-4 (SEQ ID NO:176) 
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CGGG KETENTESP.S EAAEEI v " r VE£ TKASA.TKEEE 

SKATTKSECA KFSG7TTACS KATASSTEEA AICNGSAEKQS ?AKNANPCD£ ANQVLNQLAN 
MFPGQGLPQA I LTSQTMNFL TAATTSQAC 2 N^ERYLYYAE KEAIPVNDAR VNQLTPISSF 
EKKTYGSCAE AENAYNQ 1 1 D N T GGQ?YELGY NITGYKQGAA GSSYLSWQEG NWSLWRASN 
irCGESPDDEA XrcWNIDEQE TLPAPXTVGQ ITLNYAGTTO YNRNSWWQA GTWYSVHHF 
7 P I A T /HXA T £ M 



EF048-1 ( SEQ IE NO: 177) 

7AAGGAGAAA AGTTCATGAA AAAAAGAAAG GTTTTATTTA CAGCAGTTAT GGTATTGGCA 
GGATTACAGT TGCTAAGTGG TTGCGGCAAA ACAGAA jC7T CGGCAAATGA TACGGTAGTC 
TTGCGCTATG CGTATGCTAG TAA7AGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 
GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 
TTAGGAGGAG AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG 
GTCAGTGGAT CAGCATTAGA AAGTT7TTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 
ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 
GTGTATGATT GTACAAAAAA ATTAGGATTT G7TGGTTTAA CTTATTATGA CTCTGGTCAA 
CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 
ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAATG3TAG AACTTTTAGG GGGTTCGCCA 
GTACCTATGG GTAGTTCGGA AGTATATACT 7CTCTACAAT CTAATCTAAT CAACGGTGCA 
GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 
TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AAGAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTAT2AGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-2 ( SEQ ID NO:178) 

MKKRKV LFTAVMVLAG LQLLSGCGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 
LVEEKTDGKV Q IEYFPDGQL GGETELIELT QTGAIDFAKV SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 

EF048-3 (SEQ ID NO:179) 

TTGCGGCAAA ACAGAAGCTT CGGCAAATGA TACGGTAGTC 

TTGCGCTATG CGTATGCTAG TAATAGCCAA CCAGTTATCG ATTCTATGAA GAAATTCGGT 

GAATTAGTAG AGGAAAAAAC AGATGGTAAA GTTCAAATTG AATATTTTCC AGATGGTCAA 

TTAGGAGGAG AAACAGAACT AATTGAATTA ACACAAACAG GTGCAATTGA TTTTGCAAAG 

GTCAGTGGAT CAGCATTAGA AAGTTTTTCT AAAGATTATT CTGTATTTGC CATTCCGTAT 

ATTTTTGATA ATGAAAAACA TTTTTTTAAA GTAATGGATA ATCAAGCGCT AATGCAACCA 

GTGTATGATT CTACAAAAAA ATTAGGATTT GTTGGTTTAA CTTATTATGA CTCTGGTCAA 

CGAAGTTTTT ATATGAGCAA AGGGCCTGTT ACATCTCCAG ATGATTTGAA AGGTAAAAAA 

ATTCGGGTCA TGCAAAGTGA AACCGCCATC AAAA TGGTAG AACTTTTAGG GGGTTCGCCA 

GTACCTATGG GTAGTTCGGA AGTATATACT TCTCTACAAT CTAATCTAAT CAACGGTGCA 

GAGAATAATG AGTTCGTTTT ATATACAGCT GGTCATGGTG GTGTGGCTAA GTATTATTCT 

TATGATGAGC ATACTCGAGT GCCAGATATT GTGATTATGA ACGAGGGAAC AAAAGAACGT 
TTGACAGCGA AACAAGAACA AGCGATTGAA GAAGCAGCAA AAGAATCGAC CGCTTTTGAA 
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AAAACGGTCT TTAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC 
GTTGTGTTCA ATCAAGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA 
TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTATTCGCCA GTTGGCGGAC 
TAA 

EF048-4 ( SEQ ID NO:180) 

CGKT EASANDTWL RYAYASNSQP VIDSMKKFGE 

LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAX.V SGSALESFSK DYSVFAIPYI 
FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGC'R SFYMSKGPVT SPDDLKGKKI 
RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY 
DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEK TVFKEAVEEE KKKAQAEYGV 
VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD 



EF049-1 ( SEQ ID NO: 181) 

TGAGACTCTT TCTTTTTCAA AATGAGGTAT GGTATAGTTA TAACAGANAT AAAACTANAA 
AAAACAGGAG TGCATAAGAG AATGAAGAAA AAACTAATCT TAGCTGCAGC GGG CGCAATG 
GCCGTTTTTA GTTTAGCAGC GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 
TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAG TAG AAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 
AAACAAAGAG CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC AGATGAAGAC 
TTAAAAACAG CTTGGGCAAG TTTCCATCCA GAAGTAGAAG CACAAATTAT CCAAGTTGCT 
TCAGAAGATG ATGCCAAAGC TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 
ATTGCTAAAG AAAAATCAAC AGATACTGCT ACGAAAAAAG ATGGCGGTAA AATTAAATTT 
GATTCACAAG CAACAACTGT TCCTGCCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 
GGCGAAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AACCTACTAT 
GTAGTGAAAA TGACGAAAAA CAAAGCAAAA GGCAATGACA TGAAACCTTA TGAAAAAGAG 
ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 
AGTGACGAAT TAAAAGCGGC CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 
GCAGGCTACA TGCAAACTGA ATCTTCAAGC GCTTCTTCAG AGAAAAAAGA ATCAAAATCA 
AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGACTCTG AAAAAGCAAC AGATTCTTCA 
AGCAAAACAA CAGAATCTTC TTCTAAATAA 

EF049-2 (SEQ ID NO:182) 

MKKK LILAAAGAMA VFSLAACSSG SKDIATMKGS 
TITVDDFYNQ IKEQSTSQQA FSQMVIYKVF EEKYGDKVTD 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK AHLKITDEDL 
EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG NDMKPYEKEI 
DELKAANVKI KDDAFKNALA GYMQTESSSA SSEKKESKSS 
KTTESSSK 

EF049-3 { SEQ ID NO:183) 

GTGTTCAAGC GGTTCAAAAG ATATCGCAAC AATGAAAGGT 
TCAACAATTA CTGTTGATGA TTTTTATAAC CAAATTAAAG AACAAAGCAC TAGCCAACAA 
GCGTTTAGCC AAATGGTTAT TTATAAAGTC TTTGAAGAAA AATATGGCGA CAAAGTAACT 
GACAAAGANA TTCAAAAAAA CTTTGACGAA GCCAAAGAAC AAGTAGAAGC ACAAGGCGGA 
AAGTTCTCTG ATGCATTAAA ACAAGCTGGT TTAACTGAAA AAACATTCAA GAAACAGTTA 



KXIQKNFDEA KEQVEAQGGK 
KTAWASFHPE VEAQIIQVAS 
SQATTVPAEV KEAAFKLKDG 
KKIAEETKLA DQTFVSKVIS 
DSKTSDTKTS DSEKATDSSS 
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AAA Z AAAG A G CAGCCTATGA TGCAGGTCTA AAAGCCCACT TAAAAATTAC A 3ATGAAGAC 

TTAAAAACAG GTTGGGCAAG ^^TCCA.TCCA G.-~!-iG _.nGAAG CACmAaTTAT ^ „ AhGTTG^ * 

TCA 3AAGATG ATGCCAAA3C TGTCAAGAAA GAAATCACTG ACGGCGGCGA TTTCACAAAA 

ATTGCTAAAG AAAAATCAAC AGATACTGC7 ACGAAAAAAG ATGGCGGTAA AATTAAATTT 

GATTCACAAG CAACAACT37 TCC TGGCGAA GTTAAAGAAG CTGCCTTCAA ATTAAAAGAT 

GGC3AAGTGT CAGAACCAAT TGCTGCAACA AATATGCAAA CCTACCAAAC AAGCTACTAT 

GTA37GAAAA TGACGAAAAA CAAAGCAAAA GGGAATGACA TGAAACCTTA TGAAAAAGAG 

ATCAAGAAAA TTGCTGAAGA AACAAAATTA GCCGATCAAA CATTTGTTTC GAAAGTCATT 

AGTGACGAAT TAAAAGCGG 2 CAATGTGAAA ATTAAAGATG ATGCCTTCAA GAACGCTTTA 

GCAGGCTACA TGCAAACT3A ATCTTCAAGG GCTTTTTCAG AGAAAAAAGA ATCAAAATCA 

AGTGATTCTA AAACAAGCGA TACCAAAACA AGCGAGTCTG AAAAAGCAAG AGATTCTTCA 
AGCAAAACAA CAGAATCTT: TTCTAAATAA 

EF049-4 (SEQ ID NO:184) 

CSSG SKDIATMKGS 

TITVDDFYNQ IKEQSTSQQA F S £ MV I YK V F EEKYGDKVTD KXIQKKFDEA KEQVEAQGGK 
FSDALKQAGL TEKTFKKQLK QRAAYDAGLK AHLKITDECL KTAWASFHPE VEAQIIQVAS 
eddakavkp;e ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD SQATTVPAEV KEAAFKLKDG 
EVSEPIAATN MQTYQTTYYV VKMTKNKAKG NDMXPYEKEI KKIAEETKLA DQTFVSKVIS 
DELKAANVKI KDDAFKNALA GYMQTESSSA SSEKKESKSS DSKTSDTKTS DSEKATDSSS 
KTTESSSK 



EF050-1 ( SEQ ID NO:185) 

TAGGGTCTGG AAAAGCAGTC AACTGACTTG TTTTCCAAGC CCTTTTTTAG TTCATCGCAG 
AAAGGATGNA AAAAAATGAA CATGCCCAAA AATA7CNGTT ATTTTTCTTT GCTAATGGGT 
CTTGTTCTAT TATTAAGTGC TTGCCAAATT GGGGCAACTA CGAAGGATGA CAACCAAGCC 
GCCACAAAAG AAGCAACTGT TGAGTTAAAC CGCACAACAA CACCAACGCT TTTTTTTC A T 
GGTTACGCAG GAACTAAAAA TTCGTTTGGC TCGTTACTGC ATCGCTTGGA GAAACAAGGT 
GCCACAACTC AAGAATTAGT GCTACTCGTT AAACCTGATG GGACCGTGGT TAAAGAGCGA 
GGAGCTTTAA GTGGCAAAGC GACGAATCCC AGTGTTCAAG TTCTATTTGA A G ATAATAAA 
AACAATGAAT GGAATCAAAC AGAATGGATA AAAAACACAT TACTCTATTT ACAAAAAAAT 
TATCAAGTGA ACAAAGCCAA TATTGTCGGG CACTCTATGG GTGGTGTTAG TGGTTTACGT 
TATTTAGGAA CCTATGGGCA AGATACATCG TTACCTAAAA TTGAAAAATT CGTCAGCATT 
GGAGCACCTT TCAATGATTT TATTGATACG AGTCAACAGC AAACCATCGA AACGGAACTA 
GAAAACGGCC CCACAGAAAA AAGTAGCCGC TATTTGGATT ATCAAGAGAT GATTAATGTT 
GTTCCAGAAA AACTGCCCAT TTTATTAATT GGTGGTCAAT TAAGTCCAAC AGATTTAAGT 
GATGGAACGG TGCCGTTATC TAGTGCCTTA GCAGTCAACG CCTTGCTAAG ACAGCGAGGA 
ACTCAAGTCA CTAGCCAGAT TATTAAAGGA GAAAATGCAC AACATAGTCA ATTACATGAA 
AATCCTGAAG TAGATCAATT GCTAATCGAA TTTCTATGGC CGAGTAAAAA AT AG 

EF050-2 ( SEQ ID NO:186) 

MNMPKN IXYFSLLMGL VLLLSACQIG ATTKDDNQAA 

TKEATVELNR TTTPTLFFHG YAGTKNSFGS LLHRLEKQGA TTQELVLLVK PDGTWKERG 

ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QVNKANIVGH SMGGVSGLRY 

LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY LDYQEMINW 

PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQIIKGE NAQHSQLHEN 
PEVDQLLIEF LWPSKK 



EF050-3 ( SEQ ID NO:187) 
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TTGCCAAATT GGGGCAACTA 

GCC ACAAAAG AAGCAACTGT 

GGTTACGCAG GAACTAAAAA 

GCCACAACTC AAGAATTAGT 

GGAGCTTTAA GTGGCAAAGC 

AACAATGAAT GGAATCAAAC 

TATCAAGTGA ACAAAGCCAA 

TATTTAGGAA CCTATGGGCA 

GGAGCACCTT TCAATGATTT 

GAAAACGGCC CCACAGAAAA 

GTTCCAGAAA AACTGCCCAT 

GATGGAACGG TGCCGTTATC 

ACTCAAGTCA CTAGCCAGAT 

AATCCTGAAG TAGATCAATT 



CGAAGGATGA CAACCAAGCC 
TGAGTTAAAC CGCACAACAA 
TTCGTTTGGC TCGTTACTGC 
GCTACTCGTT AAACCTGATG 
GACGAATCCC AGTGTTCAAG 
AGAATGGATA AAAAACACAT 
TATTGTCGGG CACTCTATGG 
AGATACATCG TTACCTAAAA 
TATTGATACG AGTCAACAGC 
AAGTAGCCGC TATTTGGATT 
TTTATTAATT GGTGGTCAAT 
TAGTGCCTTA GCAGTCAACG 
TATTAAAGGA GAAAATGCAC 
GCTAATCGAA TTTCTATGGC 



CACCAACGCT TTTTTTTCAT 
ATCGCTTGGA GAAACAAGGT 
GGACCGTGGT TAAAGAGCGA 
TTCTATTTGA AGATAATAAA 
TACTCTATTT ACAAAAAAAT 
GTGGTGTTAG TGGTTTACGT 
TTGAAAAATT CGTCAGCATT 
AAACCATCGA AACGGAACTA 
ATCAAGAGAT GATTAATGTT 
TAAGTCCAAC AGATTTAAGT 
CCTTGCTAAG ACAGCGAGGA 
AACATAGTCA ATTACATGAA 
CGAGTAAAAA ATAG 



EF050-4 (SEQ ID NO;188) 



CQIG ATTKDDNQAA 
TKEATVELNR TTTPTLFFHG 
ALSGKATNPS VQVLFEDNKN 
LGTYGQDTSL PKIEKFVSIG 
PEKLPILLIG GQLSPTDLSD 
PEVDQLLIEF LWPSKK 



YAGTKNSFGS LLHRLEKQGA 
NEWNQTEWIK NTLLYLQKNY 
APFNDFIDTS QQQTIETELE 
GTVPLSSALA VNALLRQRGT 



TTQELVLLVK PDGTWKERG 
QVNKANIVGH SMGGVSGLRY 
NGPTEKSSRY LDYQEMINW 
QVTSQI IKGE NAQHSQLHEN 



EF051-1 (SEQ ID NO: 189) 

TAAAAGAAAA GAGGCGTTCA AATGTCTAAA CAAAAAAAGG CTGTGTTCCT GCTTAGTTTA 
TTCAGTTTAG TTGCCCTAAT TGCTGCATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 
ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
AAAATTATTC CAGCCTTTGA AAAAGAGCAT CCAGATATTC AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AATTACAGAT GCAAATTGAA AAAGGCCTAA AAGCCGATGT ATTTTTCTCA 
GCTTCGACAA AACAAATGAA TGCATTGGTT GCAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCCTTTAT TGGAAAACCA GCTCGTTCTT ATTGTGCCTA ACCAAGATCA AGCAAAGTGG 
CATGATTTTT CTGATTTAAA AAAAGCCCAA ATGATAGCAA TTGGTGATCC TGCAAGTGTT 
CCAGCTGGTC AATATGCCGA AGAAGGCTTA AAAGCTTTAG GCGCTTGGTC TTATGTAGAA 
AAACACGCAA GCTTTGGCAC GAATGTAACA GAAGTCCTTG AATGGGTAGC TAATGCAAGT 
GCAGAAGCTG GCTTAGTTTA TGCGACAGAT GCAGCAACCA ATTCAAAAGT AGCGATTGTT 
GCGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 
GCCTCTAAGA AACAAAAATC AGCAGATGCT TTTTTAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT TTGANAATAT TGGCTTTAAG TTAACAAAGT AG 

EF051-2 ( SEQ ID NO:190) 

MSKQ KKAVFLLSLF SLVALIAACT NQPQKETVST KKEEITLAAA ASLESVMEKK 
I IPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSW 
PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK AL.GAWSYVEK 
HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK 

EF051-3 (SEQ ID NO:191) 

ATGT ACAAATCAGC CGCAAAAAGA AACAGTTTCA 

ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG 
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AAAATTATTC CAGCCTTTGA AAAAGAGCA7 CC AGA7AT7C AGGTAACTGG AACCTATGAT 
AGTTCTGGAA AA.TTA TAG AT GCAAA77GAA AAAGGGCTAA AAGiXGATGT AT77TTC7CA 
GCTTCGACAA AACAAA7GAA 7GCATTGG77 GGAGAAAAAC TAATTAATAA AAAAAGTGTC 
GTTCC7TTA7 TGGAAAACCA GCTCGT7CTT ATTGTGCCTA AGCAAGATCA AG C AAAGTG G 
CA7GA77777 CTGAT77AAA AAAAGCGCAA A7GA7AGCAA 77GGTGATCC TGCAAGTGTT 
■rcAGcrGGTC aatatgcgga agaasg:7TA AAA2CT7TAG GCGCTTGGTC TTATGTAGAA 

AAACACGCAA GC7TTGGCAC GAATG7AACA GAA37CC77G AATGGGTAGC T.kATG^AAG ^ 

GCAGAAGCTG GCTTAGTTTA TGCGAGAGAT GCAGCAACCA ATTCAAAAG7 AGCGATTGTT 

GGGGCCATGC CTGAAGCTGT TTTGAAAAAG CCAATTATCT ATCCAGTTGG TAAAGTTGCC 

GGCTC7AAGA AAGAAAAATC AGCAGATGC7 7777TAAATT TTTTACAGAG TCAACAATGC 
AGAAAATATT 7TGAMAATA7 7GGCTTTAAG TTAACAAAG7 AG 

EF051-4 ( SEQ ID wC : 192) 

C7 N£?QKETVST KKEEITLAAA ASLESVMEKK 

1 1 PAFEKEHP d:qv7gtyds sgklqmqiek glkaeyffsa STKQMNALVA eklinkksw 

PLLENQLVLI VPNQDQAKWH DFSDLKKAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK 
KASFGTNVTE VLEWVAMASA EAGLVYA7DA ATNSXVAIVA AMPEAVLKKP IIYPVGKVAA 
SKKQKSADAF LNFL.QSQQCR KYFXNIGFKL TK 



EF052-1 (SEQ ID NO:193) 

7AAAGTAGGA GAAGCGCAAG CGAAAAAAGT GAATCAA7CG GCAGCGTA7C AAGTAGTGAT 
CCCACAA7GG GTACCATGGG TAGCATTATC T77GACAG7A GCACTTGC7G GATTGATTGC 
TTACTTAG7T CGTCGTGGAG AGAAGTGGAA AAACGAAGGG GAAGTGACAT AATGAGANGA 
NGAAATCTTC NG77TTTATT ATTGTTGGT7 CTA77AATTT ATATTCCTCA AACAACTTAT 
GCAGAAAA7A GGGAGACCAC AGAAGTCGGA A7CGGG7T7A CAAAAACT7C AGACATACCA 
TCAAAAAAAA A7CCAGTTG7 GAA7GTAT7G CCGCAAACAA CCATTCAATC GCTATCAATC 
GTTGGTAGCA GAACGCAAA7 AAAAAGAT7A CCTAAAAC7G G7GACAATCG AATAAC7TGG 
CTAAGCTGGT T7GGCATA7T GT777TAATA AGTAG7TTTT GGCTGTTTCT ATTTAGACAA 
T7ATGTAGAA AAGGAGAATA A 

EF052-2 (SEQ ID NO:194) 

MRXX 

NLXFLLLLVL LIYIPQTTYA ENRETTEVGI GFTK7SDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKRLP K7GDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE 

EF052-3 ( SEQ ID NO:195) 

AGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGG7TTA CAAAAACTTC AGACATACCA 
TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC 
GTTCGTAGCA GAACGCAAAT AAAAAGAT 

EF052-4 (SEQ ID NO:196) 

ENRETTEVGI GFTKTSDIPS KKNPWNVLP QTTIQSLSIV 
RSRTQIKR 



EF053-1 (SEQ ID NO: 197 ) 

TAGTCATGGC ACCATAACAA GGAGGAGAGA AGTGAGA7GA AAAAATACC7 TTTGCTTAGT 
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TGTTTTTTAG GTCTTTTCAG CTTCTGTCAT TCAGACACTG CGTTTGGAGA AGCAGCTTAT 
G AAAA TAG TG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGATTACC AGCGACAGGT 
ACCACCAATC AAGCACCATT TATTTATTTG GGAATCAGCC TTATCACTAT AGGCATATTA 
TTTATTAAAA GGAGAAGAGA AGATGAAAAA AACAGTATTA GCAGTAGTAG GGATTGTAGG 
ATTTAG 

EF0B3-2 (SEQ ID NO:198) 

MKKYLLLSC FLGLFSFCHS DTAFGEAAYE NSGWSFYGT YEYPTEESTT 

ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGRLPATGT TNQAPFIYLG ISLITIGILF 

IKRRREDEKN SISSSRDCRI 

EF053-3 (SEQ ID NO:199) 

TTTGGAGA AGCAGCTTAT 

GAAAATAGTG GTGTTGTCTC CTTTTATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA 
ACAGCGACTA GTAATTCTTC CACAACGACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT 
TCATCCGTCC TTTCTTCTGG CGTATATGGA TCGCGACAAG GAAGA 

EF053-4 (SEQ ID NO:200) 

FGEAAYE NSGWSFYGT YEYPTEESTT 
ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGR 



EF054-1 (SEQ ID NO:201) 

TAAATAAAAA ATTATTTGGA GGAAATTACA ATGAAAAAAA TTATTTTATC AAGCTTGTTT 
AGTGCAGTAC TAGTATTCGG TGGCGGAAGT ATAACAGCAT TCGCTGACGA TTTAGGACCA 
ACAGATCCAG CAACTCCACC AATTACCGAA CCAACTGATT CTAGTGAACC TACGAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACCGCCA GTAATACCAA CTGATCCAAC AGAACCAAGC 
AAGCCAACCG AGCCTACAAC A C C GAG TG AG CCAGAAAAGC CAACAGAACC AACAACGCCA 
ATTGATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC AAGCGAAC C A GAAAAACCTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TCCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAACAGCCCA CTAAACCAAT AGACGTAGTC GTTACACCTA GTGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT CCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
G TAG ATAAAG GTGTTCCGTT AACCAAAACA CCAGAAGGAT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAAGCAA GTGATGGAAA AATGAAAGTA 
TTGCCACATA CAGGAGAGAA ATTCACACTC CTTTTCTCTG TATTGGGAAG CTTCTTTGTA 
TTAATTTCAG GATTCTTTTT CTTTAAAAAG AATAAGAAAA AAGCTTAA 

EF054-2 (SEQ ID NO:202) 

M KKIILSSLFS AVLVFGGGSI TAFADDLGPT DPATPPITEP TDSSEPTNPT 
EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKVL PHTGEKFTLL FSVLGSFFVL ISGFFFFKKN KKKA 



EF054-3 (SEQ ID NO:203) 
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A 

ACAGATCCAG CAACTCCACC AATTACZGAA CCAACTGATT CTAGTGAACC TACGAATCCT 
ACTGAGCCGG TGGATCCTGC AGAACC 3CCA GTAATAZCAA :TGATCCAAC AGAACCAAGC 
AAG CCAACCG AGCCTACAAC ACCGAGTGAG 2 C AG AAAAG C ZAACAGAACC AACAACGCCA 
ATT3ATCCTG GAACGCCGGT TGAACCGACT GAACCAAGCG AGCCAACAGA ACCTAGTCAA 
CCAACCGAGC CTACAACACC AAGCGAACCA GAAAAj\ Z CTG TTACTCCAGA ACAACCGAAA 
GAACCAACTC AACCAGTGAT TZCAGAAAAA CCAGCAGAAC CAGAAACACC AAAAACTCCT 
GAAGAGCCCA CTAAACCAAT AGACGTAGTC GTTACAZCTA 3TGGAGAAAT TGATAAAACG 
AATCAATCGG CAGGAACACA ACCAAGTATT GCTATTGAAA CAAGCAACTT AGCGGAGGTA 
ACACATGTAC CAAGTGAAAC TACTCCAATT ACAACAGAAG CTGGGGAAGA AATTGTAGCA 
GTAGATAAAG GTGTTCCGTT AACCAAAACA CCAGAA33AT TAAAACCAAT TAGCAGCTCG 
TATAAGGTTT TACCTAGCGG AAACGTTGAG GTAAAA3GAA G TG ATGG AAA AATGAAAGTA 
T 

EF0S4-4 ( SEQ ID NO:204) 
DDLGPT DPATPPITEP TCSSEPTNPT 

EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEr TTPI DPGTPVEPTE PSEPTEPSQP 
TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETrKTPE QPTKPIDVW TPSGEIDKTN 
QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEAGEEI VAV DKGVPLTKTP EGLKPISSSY 
KVLPSGNVEV KASDGKMKV 



EF055-1 {SEQ ID NO:205) 

TAACAAAAGG TTGTTTTGTC TTTCTTGTGT AAAAGGGCAA GAAAGGCTAG CG AG TTAAAA 

ggaggttttt CAATGAAAAA AAAGCGTTAT TTAATGA7TG tgtgtctact atcttctcct 

AGTTTTTTTA TAAATGTTGA AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG GATTGAATTT 

TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT 

GGTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA 

CAGACCACCA CAACTGGCTC AACGCTCCCT CGTACAGGGA GCAAGAGTCA GGCAAATTTG 
AGCATTCTCN GNTTCGCCTT AATCGGTTTG GCGGGAATCG TACATAGAAA GAAGGGACGA 
CATGAAGCAA ACTAA 

EF055-2 (SEQ ID NO:206) 

MKKKRYL MIVCLLSSPS FFINVEASDG GSSSVGIEFV 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGSTLPR TGSKSQANLS 
ILXFALIGLA GIVHRKKGRH EAN 

EF055-3 (SEQ ID NO:207) 

AGCGTCTGAT GGTGGTTCTA GTTCGGTGGG GATTGAATTT 

TACCAAAATC CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT 
GCTGATCCCA AGGAACCAGC TGGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA 
CAGACCACCA CAACTGGCTC AACG 



EF055-4 (SEQ ID NO:208) 
SDG GSSSVGIEFY 

QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTTGST 
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EF056-1 (SEQ ID NO:209) 

TAAATGAAAA AAAAGCGTTA TTTAATAATT GCGTGTTTAC 
ATAAATGTTG AAGCATCTGA GGGTGGTTCT AGTTCGGTGG 
CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG 
ACAGCTGGCT CGCAGCTCCC TCGTACAGGA AGCAAGAGTC 
GGTCTTGTCT TGATTGGTCT TGTCGGAATG GTCCAGAGAA 
AACTAA 

EF056-2 (SEQ ID NO:210) 

MKKKRYLIIA CLLFSPSFFI NVEASEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK 
EPAGPLQGDQ RSGGSTQTTT AGSQLPRTGS KSQANLSILG LVLIGLVGMV QRKKGRHEAN 

EF056-3 (SEQ ID NO:211) 

ATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAATT TTACCAAAAT 

CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC TGCGGATCCC 
AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGACCACC 
ACAGCTGGCT CGCAG 

EF056-4 (SEQ ID NO:212) 

SEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK 
EPAGPLQGDQ RSGGSTQTTT AGSQ 

EF057-1 (SEQ ID NO:213) 

TAATGTTTAT TGGCTGGGCC AGTCAATGTT GAAAATGGGG AAGGAGGAAT TCAGATGAAA 

ATCATAAAAA GGTTTAGTTT GGTATGTTTA GGGCTATTGA TCATTGGGTT GCNAACAAAA 

AGCGNTATGG CTGAAGAAAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 

TACGTTTATG AGAATGAAAA AG AG TC AAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 

CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT TACCTAAAAC AGGCGAGTCT 

GAAAATCCGC TGTATTCCTT GATAGGAGTT AGTTTGTTGG GGATAGTCAT TTATTTAATT 

AATAAAATGA AACGAGAGAA GGAGTTTATT TAA 

EF057-2 (SEQ ID NO:214) 

MKI IKRFSLVCLG LLI IGLXTKS XMAEENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLIN 
KMKREKEFI 

EF057-3 (SEQ ID NO:215) 

AAA TAATTATGAA TCAAATGGTC AAGCGAGCTT CTATGGTACC 

TACGTTTATG AGAATGAAAA AGAGTCAAAT GACGTAGCGT ATACCCAACA ATCAGAAGAA 
CAGGGAAGAA ACAATTTAGC TGCTTCTGGA CAAGCAGTTT 

EF057-4 (SEQ ID NO:216) 

EENNYES NGQASFYGTY 

VYENEKESND VAYTQQSEEQ GRNNLAASGQ AV 



TATTTTCCCC 
GAATTGAATT 
ATGAGCCAGC 
GTGGTTCGAC 
AGGCAAACCT 
AGAAGGGACG 



TAGTTTTTTT 
TTACCAAAAT 
TGCGGATCCC 
ACAGACCACC 
GAGCATTCTT 
ACATGAAGCA 
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EF058-1 (SEC 13 NO: 217) 

-GAAGAACGT TCTATTTGGT TGACGATTGC AGGCC7GCTA ATCAT7GGGA TGGTAGTCAT 
TTGGC-ATTT TATCAAAAAC AAAAAAGAGG AGAGAGAAAA TGAAGCAATT AAAAAAAGTT 
TGGTACACCG TTAGTACCT7 GTTACTAATT TTGCCACTTT TCACAAGTGT ATTAGGGACA 
ACAACTGCAT TTGCAGAAGA AAATGGGGAG AGC3CACAGC TCGTGATTCA CAAAAAGAAA 
A^GACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT 3CAGCTAAAC AAGCTGTCCA AAGTTTAACT 
C-TGGGAAAC CTGTTGCTCA AGGAACCACC 3ATGCAAATG GGAATGTCAC TGTTCAGTTA 
CC-AAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG T7CCCAGTTT ACGAAATGAT CAAGCAAACA 
"ATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTC ATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT T ATTTC T AAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
A^C^AAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
T-TATTACTG GGAAAAGTTA TGAAATTGGC G AAAATG ATT TCACAGAAGC AGAGAATGGA 
AC GGGAGAAT TAACAGTTAA AAATC TTG AG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG CTAATAAATA CGTCAAATTC 
AATTTAGTTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGG AG AG TAT 
GC-TATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT TTAAATGAAA AAGCAGATCC TACGAAAGGC 

_ »^~- t r-T'v A rr G prr- &CCAAACACC ACCAACTGTT 

GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG ATTAGTTGAT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA C AAATC GG AT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC TTCAACAGGT GGCAAAGGAA TCTACGTTTA CTTAGGAAGT 
GGCGCAGTCT TGCTACTTAT TGCAGGAGTC TACTTTGCTA GACGTAGAAA AGAAAATGCT 
TAA 

EF058-2 ( SEQ ID NO:218) 
MKQLKKVW YTVSTLLLIL PLFTSVLGTT 

TAFAEENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AXQAVQSLTP GKPVAQGTTD AKGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTO TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVDI TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK KKGTLPSTGG KGI YVYLGSG AVLLLIAGVY FARRRKENA 



EF058-3 { SEQ ID NO:219) 
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AGAAGA AAATGGGGAG AGCGCACAGC TCGTGATTCA CAAAAAGAAA 

ATGACGGATT TACCAGATCC GCTTATTCAA AATAGCGGGA AAGAAATGAG CGAGTTTGAT 
AAATATCAAG GACTGGCAGA TGTGACGTTT AGTATTTATA ACGTGACGAA CGAATTTTAC 
GAGCAACGAG CGGCAGGCGC AAGCGTTGAT GCAGCTAAAC AAGCTGTCCA AAGTTTAACT 
CCTGGGAAAC CTGTTGCTCA AGGAACCACC GATGCAAATG GGAATGTCAC TGTTCAGTTA 
CCTAAAAAAC AAAATGGTAA AGATGCAGTG TATACCATTA AAGAAGAACC AAAAGAGGGT 
GTAGTTGCTG CTACGAATAT GGTGGTGGCG TTCCCAGTTT ACGAAATGAT CAAGCAAACA 
GATGGTTCCT ATAAATATGG AACAGAAGAA TTAGCGGTTG TTCATATTTA TCCTAAAAAT 
GTGGTAGCCA ATGATGGTAG TTTACATGTG AAAAAAGTAG GAACTGCTGA AAATGAAGGA 
TTAAATGGCG CAGAATTTGT TATTTCTAAA AGCGAAGGCT CACCAGGCAC AGTAAAATAT 
ATCCAAGGAG TCAAAGATGG ATTATATACA TGGACAACGG ATAAAGAACA AGCAAAACGC 
TTTATTACTG GGAAAAGTTA TGAAATTGGC GAAAATGATT TCACAGAAGC AGAGAATGGA 
ACGGGAGAAT TAACAGTTAA AAATCTTGAG GTTGGTTCGT ATATTTTAGA AGAAGTAAAA 
GCTCCAAATA ATGCAGAATT AATTGAAAAT CAAACAAAAA CACCATTTAC AATTGAAGCA 
AACAATCAAA CACCTGTTGA AAAAACAGTC AAAAATGATA CCTCTAAAGT TGATAAAACA 
ACACCAAGCT TAGATGGTAA AGATGTGGCA ATTGGCGAAA AAATTAAATA TCAAATTTCT 
GTAAATATTC CATTGGGGAT TGCAGACAAA GAAGGCGACG C TAATAAAT A CGTCAAATTC 
AATTT AG TTG ATAAACATGA TGCAGCCTTA ACTTTTGATA ACGTGACTTC TGGAGAGTAT 
GCTTATGCGT TATATGATGG GGATACAGTG ATTGCTCCTG AAAATTATCA AGTGACTGAA 
CAAGCAAATG GCTTCACTGT CGCCGTTAAT CCAGCGTATA TTCCTACGCT AACACCAGGC 
GGCACACTAA AATTCGTTTA CTTTATGCAT T T AAATG AAA AAGCAGATCC TACGAAAGGC 
TTTAAAAATG AGGCGAATGT TGATAACGGT CATACCGACG ACCAAACACC ACCAACTGTT 
GAAGTTGTGA CAGGTGGGAA ACGTTTCATT AAAGTCGATG GCGATGTGAC AGCGACACAA 
GCCTTGGCGG GAGCTTCCTT TGTCGTCCGT GATCAAAACA GCGACACAGC AAATTATTTG 
AAAATCGATG AAACAACGAA AGCAGCAACT TGGGTGAAAA CAAAAGCTGA AGCAACTACT 
TTTACAACAA CGGCTGATGG A TT AG TTG AT ATCACAGGGC TTAAATACGG TACCTATTAT 
TTAGAAGAAA CTGTAGCTCC TGATGATTAT GTCTTGTTAA CAAATCGGAT TGAATTTGTG 
GTCAATGAAC AATCATATGG CACAACAGAA AACCTAGTTT CACCAGAAAA AGTACCAAAC 
AAACACAAAG GTACCTTACC T 

EF058-4 (SEQ ID NO:220) 

EENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDK YQGLADVTFS IYNVTNEFYE 
QRAAGASVDA AKQAVQSLTP GKPVAQGTTD ANGNVTVQLP KKQNGKDAVY TIKEEPKEGV 
VAATNMWAF PVYEMIKQTD GSYKYGTEEL AWHIYPKNV VANDGSLHVK KVGTAENEGL 
NGAEFVISKS EGSPGTVKYI QGVKDGLYTW TTDKEQAKRF ITGKSYEIGE NDFTEAENGT 
GELTVKNLEV GSYILEEVKA PNNAELIENQ TKTPFTIEAN NQTPVEKTVK NDTSKVDKTT 
PSLDGKDVAI GEKIKYQISV NIPLGIADKE GDANKYVKFN LVDKHDAALT FDNVTSGEYA 
YALYDGDTVI APENYQVTEQ ANGFTVAVNP AYIPTLTPGG TLKFVYFMHL NEKADPTKGF 
KNEANVDNGH TDDQTPPTVE WTGGKRFIK VDGDVTATQA LAGASFWRD QNSDTANYLK 
IDETTKAATW VKTKAEATTF TTTADGLVD I TGLKYGTYYL EETVAPDDYV LLTNRIEFW 
NEQSYGTTEN LVSPEKVPNK HKGT 



EF059-1 (SEQ ID NO:221) 

TAGATTGGAA GAATGAAAAT GAAAAAAATG 
TTAGCAGGGG GAAGCAGTGT TTCTGCTTAT 
ACAACAGGGA GTGTTTTACC AGATGAACCG 
GAGCCAGAGC AACCAACAGA GCCAAGTACA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG 
CCAACAGAGC CAACAACGCC AAGTAAGCCA 
GTACCAGAGC AACCAACAGA GCCAAGTGTA 



ATT ATT AT TG CCTTATTCAG TACAAGCCTT 
GCGCAAGAAT CAGAAGGAAA TCTTGGTGAA 
AATGTACCAA CTGACCCAAT AACGCCAAGT 
CCAGAGCAAC CATCGGAACC GTCAACACCA 
GATCCTTCGT TACCAGACGA ACCGAGCGTA 
GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
CCAGAAAAAC CAGTAGAACC AAATAAACCA 
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ACCGAGCCAG AAAAGCCTGT G:CAGTTGTT CCT3AAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTG3T 3 5TAAAGCCA AAT3GAGAAA TTGCAACAGG AGAATCTACA 

:aacagccaa ctgttccaat tgaaacgaat aacctttcag aagtaacaca tgtcccaact 

3TGACGACAC C3ATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGCCTGA T3GATT AAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTACTTCC TTACACTGGT 
CAAAAAATGG GCATAATTGG GTCAATCGCT GGT3TATGTT T2ACTGTTTT ATCAGGAATC 
TTAATTTATA AAAAACGTAA AGTGTAG 

EF059-2 ( SEC ID NC:222) 

MKKMI I IALFSTSLL AGGESVSAYA QESEGXLGET TGSVLPEEPN VPTDPITPSE 
PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTPSKPE QPTEPTTPSV 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQ2?E QPTDVWKPN GEIATGESTQ 
QPTVPIETNN LSEVTHVPTV TTP I ETASGE AIVAVDKGVP LTQTADGLKP IKSEYKVLPS 
GNVQVKSADG KMKVLPYTGE KMGIIGS1AG VCLTVLS 3 I L I YKKRKV 

EF059-3 ( SEC ID NO:223) 

AGAAGGAAA TCTTGGTGAA 

ACAACAGGGA GTGTTTTACC AGATGAACCG AATGTACCAA CTGACCCAAT AACGCCAAGT 
GAGCCAGAGC AACCAACAGA GCCAAGTACA CCAGAGCAAC CATCGGAACC GTCAACACCA 
ACCGAACCTA GTGAGCCTTC AAAACCGACG GATCCTTCGT TACCAGACGA ACCGAGCGTA 
CCAACAGAGC CAACAACGCC AAGTAAGCCA GAGCAACCAA CAGAGCCAAC AACGCCAAGT 
GTACCAGAGC AACCAACAGA GCCAAGTGTA C C AG AAAAAC CAGTAGAACC AAATAAACCA 
ACCGAGCCAG AAAAGCCTGT GCCAGTTGTT CCTGAAAAAC CAGTTGTACC ACAACAACCA 
GAGCAACCAA CAGATGTGGT GGTAAAGCCA AATGGAGAAA TTGCAACAGG AGAATCTACA 
CAACAGCCAA CTGTTCCAAT TGAAACGAAT AACCTTTCAG AAGTAACACA TGTCCCAACT 
GTGACGACAC CGATTGAAAC AGCAAGCGGA GAAGCAATTG TCGCAGTGGA TAAGGGCGTT 
CCTTTAACAC AAACGGCTGA TGGATTAAAA CCGATTAAAA GTGAATATAA AGTATTACCA 
AGTGGCAATG TACAAGTGAA AAGTGCTGAC GGAAAAATGA AAGTAC 

EF059-4 ( SEQ ID NQ:224) 

EGNLGET TGSVLPDEPN VPTDPITPSE 

PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTPSKPE QPTEPTTPSV 
PEQPTEPSVP EKPVEPNKPT EPEKPVPWP EKPWPQQPE QPTDWVKPN GEIATGESTQ 
QPTVPIETNN LSEVTHVPTV TTP I ETASGE AIVAVDKGVP LTQTADGLKP IKSEYKVLPS 
GNVQVKSADG KMKV 

EF060-1 (SEQ ID NO:225) 

TGAAAAATAG ACAAGGAGCA CGCGATGATG ACAATGAAAA GTAAAGGGTC ACTTCTGGTG 
ACGTTGGGAA TACTTTTAAC CGTTGGCATT GCGAGTCTAA TTGTTTCTTC TGAGAGTTTT 
GCAGAAGAAG TAGGGCAAAC GAATATCGGT GTAACG r ITCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT TACCAAGGAC AGGTGAACGA AATAGCACGT GGCTTTACAG CCTTGGTATT 
GCCTGTTTAC TCGTAGTACT AACAAGTTTC TATTATTTGA ATAAAAAAAG GAAAAAGGAA 
AAATAA 

EF060-2 (SEQ ID NO:226) 

MMT MKSKGSLLVT LGILLTVGIA SLIVSSESFA EEVGQTNIGV TFYGGKEPLK 
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TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TSLPRTGERN STWLYSLGIA 
CLLWLTSFY YLNKKRKKEK 

EF060-3 ( SEQ ID NO:227) 

AGAAGAAG TAGGGCAAAC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA 
AAAACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG 
TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA AAACGAATCC GACTAATGCA 
CAGACGTCAT 

EF060-4 (SEQ ID NO:228) 
EEVGQTNIGV TFYGGKEPLK 

TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TS 
EF061-1 {SEQ ID NO:229) 

TAATGGAACG ACCGACAGAA GAAGATTTTG AACTTACAAA TTAAAATTAA AATGGAGGAA 
ATAATGATGA AAAAAATTCT TTTTGCTAGT TTATTTAGTG CCACACTACT ATTTGGGGGA 
AG TG AAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTACA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG G AAATG T AG A AGTAAAAGGT AAGGACGGTA AAATGAAGGT TTTACCATAC 
ACAGGTGAAG AAATG AATAT CTTTTTATCT GCCGTAGCGG TATCTTGTCT GTAG 

EF061-2 (SEQ ID NO:230) 

MMKKILFASL FSATLLFGGS EISAFAQEII PDDTTTPPIE 

VPTEPSTPEK PTDPTPPIEP PVDPVEPPIT PTEPTEPTEP TTPTEPTTPT EPSEPEQPTE 
PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE SEKPVQPAEP SKPIDVWTP TGELNHAGNG 
TQQPTVPIET SNLAEITHVP SVTTPITTTD GENIVAVEKG VPLTQTAEGL KPIQSSYKVL 
PSGNVEVKGK DGKMKVLPYT GEEMNIFLSA VAVSCL 

EF061-3 (SEQ ID NO:231) 

G AAATTT CTGCTTTTGC ACAAGAAATT ATCCCTGATG ATACTACGAC ACCGCCCATT 
GAAGTACCAA CAGAACCAAG TACACCAGAA AAGCCAACAG ATCCAACACC GCCAATTGAG 
CCACCTGTAG ACCCTGTAGA GCCACCTATT ACACCAACGG AGCCAACAGA ACCGACAGAG 
CCGACAACAC CAACAGAACC TACAACTCCT ACAGAGCCAA GTGAACCAGA ACAACCAACG 
GAGCCAAGTA AACCAGTAGA ACCTGAAAAA CCAGTTACAC CAAGCAAACC AGCAGAACCC 
GAAAAAACTG TGACACCAAC TAAACCAACA GAATCTGAAA AACCAGTACA ACCAGCAGAA 
CCAAGCAAGC CAATCGACGT TGTTGTAACG CCAACAGGGG AATTAAATCA CGCTGGAAAT 
GGTACACAAC AGCCAACAGT CCCTATTGAA ACAAGTAATT TGGCAGAAAT CACGCACGTG 
CCTAGTGTAA CAACACCTAT TACAACTACA GACGGAGAAA ACATTGTAGC TGTAGAAAAA 
GGTGTTCCAC TTACACAAAC AGCAGAAGGG TTAAAACCTA TTCAATCNAG TTACAAAGTA 
TTGCCTAGCG GAAATGTAGA AGTAAAAGGT AAGGACGGTA AAATGAAGGT TT 
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EFC61-4 (SEQ 



qe:i fdotttp?:e 

VPTEPSTPEK PTGPTFPIE? PVDPVEPPIT PTEFTr PTE? TTFTEPTTPT EPSEPEQPTE 
PSKPVEPEKP VTPSKFAEPE KTVTPTKPTF £EKFV;FAE? SEP ICVVVT? TGELNHAGNG 
TQQPTYPIET SNLAEITHYP S7TTPITTTC GENIYA7EEG VPLTQTAEGL KPIQSSYKVL 
? S G NYE VK G E DG KMK V 

EFQ62-1 ( SEQ ID NO: 23 3} 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGAGATATAA AGCTAAGAAT 
_ ACTGGGTAA C TGTCCGTAT TTTTTTTGTA AGTGTC-TTA "J GAGGCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA GAACGACGGT TCAACCCAAT 
AACCCCGACC TG ZAGTCAGA AAAGG AAA C A CCTAAAACG3 GAGTATCTGA AGAAGGAACA 
GTACAAAAAG AGAGTAGTTC TGAAGCGACC AAAGTAGAA'J AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TAGCGCAAAT GATACGACAA ACGGGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGC .: :TGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATG AA 3 TGAATAAATC AACGTCCATT 
G C TAAAG AAT TTGAAACAGC A3ACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGGA ACGTTTCTTC TAAAGATTTA 
GGTGCAAAAG AAAAAGAAGT AGACCAACTA C AAAAAGAA " AAGGGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA A G AG CG NT AN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATT AG TAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC A3GGGATTCA AAAGATAT"T TTACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNTACTGG CAACTTCCAG AATTCCTTTG TAAAAGAGGC AAATC TTGGG 
TCTAATGGTG GGTATGCGGT TGTTTTAGAA AAAAA TAAAG CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAGTAT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCAGATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAAGGTG GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CNTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCGTACT TTGGTTCAGG TGTAGGTCTA 
GGNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAGGAAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATGCGCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAAG TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAG AGGGGATACG 
ACAGAACTTG GCAAAGTAGA TCCAAAAACA G T AA C N AAA C AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTGAGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAA TACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTG CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCG ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGG 3GANA AAACGTTAAA AGCAGGAACA 
GATATTTCTG GCTACATTGT TTTAGAAAAC AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA AGAAGCTTGG 
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TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGAZTG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGANA TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAG7GG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGTCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAAC 3TTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATAl'GC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAATAr CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCTGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCTT 2 GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF062-2 (SEQ ID NO:234) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NYQAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPE?:KG TEC'S SATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL. GQPTEVAPAE NEVNKSTSIP KEFETPDVDK AVDEVKKDPN 
I TWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKK I AQQ AAFLKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFCwSV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEATjLGS NGGVAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNCPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFNXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRPTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAKHXTN ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRF7GKW HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAR GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXI ITN QFTISWDDAK GTVTXSAKDP 
QAFIbAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IFTIJTWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEY AG VY E EW S3SDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDD I S KLFTMTFEQG WKITA^AF XDAMNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPvjTPP EKTVIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGFK RRKETK 



EF062-3 (SEQ ID NO:235) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AG A CAT ATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
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GATAATGTAC AAGCCGCGGA AT7AGATACG CAACCAGAAA CAACGACGGT TCAACCCAA7 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAG'I ATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTT: 7CAACCGACC AAAGTAGAAG AAG7AGCGC- AGAAAATAAA 

ggtactgaa: aaagttcag: taccccaaat gataccacaa acg:gcaaca accaacagta 
."5gagct3aaa aatcagcac a agaacaacca gtag7aag":"" crg.-aacaa- caatgaacc7 

:TAGGGCAGC GAACAGAAGT 7GCACCAGCT GAAAA7GAAL' TG AATAAATC aacgtccatt 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGGAGTT'" : A T G A~ G T AAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA AC 37TTCTTC TAAAGATTTA 
GCTGGAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAA/ AA G C G AAAAA GATTGCCCAA 
GAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATT^CCA AA3AAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GATCAAA7AA - TC^CNGAATA CAACAAGCA7 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA AC:TAGTGTT CGATCAATCT 
3TCGTGACGA AAGACACTAA AATTTCGTCG ATT AAA GC C " GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATAT:" TTAGAAAATT ACGGAAAGAT 

atgggnggga aagntactgg caacttgcag aattccttt:; TAAAAGAGGC AAATCTTGGG 

TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAA T AAA " CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAA3 C A 3 AATTTG T TTATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCA 3TAT TTTTAAACGA TCCGATTATC 
ACKGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAG jAT.5 TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAGTA CTACCA3ATA AAGATAGTCC ATTTGCGTAT 
GCGCTGTCTT CTTTAAATTC AAGTTTAACG AATAAA 3GT GCCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAAT3GN7 CKTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGZA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TX5TAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAACAA AAG 3AAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACTTAA ATG 7GCAATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAAC C A G AAAAA G Z AA CGATTGAATT CAATNGATAC 
AAAGCCAATG TCGTTCCTGT NCTTGTGCCN AATAAAGAA3 TCACTGATGG NCAGAAAAAT 
NTCAATGATT TAAATGTGAA NCGTGGCGAT TCTTTACAAT ACATTGTGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAA- AAGGGATTCG AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAS TTTATCAAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCIJATTA ATT 3 AGGAAN AGCTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACCG TCACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTACTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACACCTG TTCAGCTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACCC ATGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGAIIA AAA C G TT AAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGAC A/\TA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGTAAIA AA3TAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGG AAG AI j A TTAANCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTCT TCGATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG G G AC G ATG CC AAAGGCACGG TGACNATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTG Z GTGTAACNCT CC CTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AA ' Z AAAAT A C ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCZA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAG AAT AC 3CTGGNGT TGTGGAAGAA 
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TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 

TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 

TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAC TGA AAATCACGGC CAGTCAAGCC 

TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 

ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 

AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 

CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TG 



EF062-4 <SEQ ID NO:236) 
AELDTQ PETTTVQPNN 

PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TE^SATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKS7S IP KEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIAQ^ AAELFJVKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFE~'SV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEANL.GS NGGVAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDPIIT AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGAI INAFKYI NGSYVXKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA NGRISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFi:XYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRHTFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTII ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSFDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTKDRFTGKW HAITXYDLKV GXKTLKAGTD 
I SAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKfrAWS VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGKVAP GDVLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITIi QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQF. IKTIJ7WNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEW SI SDKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAF XDAMIJLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQ7PP EKTVIVPPTP KTPQAPVEPL 
WEKASV 

EF063-1 { SEQ ID NO:237) 

TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT 
CACTGGGTAA CTGTCCCTAT TCTTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 
GATAATGTAC AAGCCGCGGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT 
AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAACGG CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC TCAACCGACC AAAGTAGAAG AAGTAGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACGCGCAACA ACCAACAGTA 
GGAGCTGAAA AATCAGCACA AGAACAACCA GTAGTAAGC": CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGCT GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGAT AAAGCAGTTG ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTAGGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTGCCA AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAAG TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA ACCTAGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGGCG GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAGTAAATGC AGGGGATTCA AAAGATATCT TTACAAAATT ACGGAAAGAT 
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TABLE i . Nucleotide and Amino Acid Scqcuenccs (•: I: jaccuns Genes. 



A7GGGNGGGA AAGNTA3TG3 CAACTTCCAG AAT'a :?7T7 7L : TAAAAGAGGC AAATCTTGGG 

~~TAA m GGTG GG7ATGCGG7 7C7777AGAA AAAAATAG.A;: CAG7 GACAGT GACCTATACA 

GGACTAAACG GTAGT7ATTT AGGACGTAAA A77ACAAAA:: ~ A G AATTTG T TTATGAACTA 

CAATCCTCAC CAAGC 3 AAA 3 7GGAACGT7A AATGCAGTAT TT7 3AAACGA TCCGATTATC 

ACNGCTTTTA 7TGGTACAAA CAGAGTCAAT GGTAAGGAi: . -AAAACACG CTTAACGATT 

AAGTTC7TT3 A7GCG7GAGG TAAAGAAGTA CTACCAGAOA AAGA7AGTCC ATTTGCGTAT 

GCGCTGTCTT CTTTAAATTC AAGT'TTAACG AA7AAAGGTG G:; 3 A7GCGGA ATTTG7Ti^.i 

GATT7TGGGG CNAACAATGC G7TCAAATAC A77AA7G ^:i7 C::7A7GTGAA AAAACAAGCG 

gatggaaaat tttactcacc ggaagatatt gactatgg;:;, cagga^ttc TGGAT7GAAA 

AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTAC7 77GG77 3AGG TGTAGGTCTA 
GCNAA7GGNC G7ATT7CCT7 TTCTTTTGGT A7GACAACAA AAGGAAAAAG TAATGTGCCT 
GTATC7AGTG CGCAATGGTT TGCCTTTAGN ACTAAC77AA A7C- 7 G 7 AATC AGTGAAGCCT 
ATTTTCAATT ATGGGAATCC AAAAGAACCA GAAAAAGCAA CGA77 3AATT CAATNGATAC 
AAAGCCAATG 7CGTTCCTG7 NCTTG7GCCN AATAAAGAAG 7CA::7 3ATGG NCAGAAAAAT 
NTCAATGATT 7AAATGTGAA N~GTGGCGAT 7CTTTACAA7 ACA7T3TGAC AGGGGATACG 
ACAGAACTTG CCAAAGTAGA TGCAAAAACA GTAACNAAAG AAG G G A TTC G AGATACNTTT 
GATGCAGAAA AAGTGACGAT TGATT7ATCC AAAG7GAAAG T77A7 3AAGC AGACGCAAGT 
CTNAACGANA AAGACTNAAA AGCTG7TGCT GCAGCI JA7TA ATTC AGGAAN AGCTAAAGAC 
GTGAC7GCTT CTTATGANCT CAATTTAGAT CAAAACA:::G T7ACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGG7A7AAA7 A77TAGTTGT CTTGCCGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAG Z T 3 77CAGGTGAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATGCTTC CAAAGATGTA 
AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAC ATGA1 AAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGTC CAGCHAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGA™ ATGA7 3GTTT CACAGGNAAA 
7GGCACGCTA TTACKAANTA TGACCTTAAA GTAGGGGAN'A AAA C: 3 7TAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTT AG AAAA C AAAGACAATA AAGACTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGGAAT A AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC 7GGAAGTCGA ACGGATNAAA ACAGGTGACG 7 A 1 3 A AAA C A C GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGG7GA CG"A7ACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAG^KA TTAAM^ATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TG GG A C T T AA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC G777CTTT7 7 TCGA7GATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGAT7 CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCACGG TGACI1ATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAA7TCC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGCGG AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAAI1CCTAA AAAAGACGTG 
GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNGCCA CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAA7 ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAA7 TTAG73GCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATGACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACG7TG CACA 77CATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAAJA CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATACGC CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAACAC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGGG 37 AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGC7T3 GCT7AGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF063-2 (SEQ ID NO:238) 
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TABLE 1. Nucleotide and Ammo Acid Seqeuences of E. fact a I is Genes. 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATF ^OAAELE 7Q PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPE^r" ? 7E .' 3SATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEW ST..: j P KEr E7PDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLC 1 KEQAKK1 A> AA7LKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLYFDC^'V VTITTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQM SFVKEAMLGS HGGVAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLIJ AVFSI1DP::T AFIGTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTH KGGHAEFVST FGAMNAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGPSGLKN SDWDAVGHKN AYFGSGVGLA I IGF I SFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFKXYP'. ANVVPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIF.ETFf AEKV7IDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ N7VTAMKK7t: ADIASWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKEYF ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLU TTHDRFTGFW KAI7XYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKCAWS VYL.ZVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVKNKKG EXIXHGKYAR GEYLSYEMTW DLKGYDKDFA 
FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXIJITH rFYISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQH7F~;F IKTI3TWNHI PKVXPKKDW 
IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWI.FW ^ 1 .'"-DKLDVKH DKFSGQWSVF 
ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASOAF XE'AUNLKENK NVAHSWKAFI 
GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKP^IY? EK7YIVPPTP KTPQAPVEPL 
WEKASWPE LPQTGEKQNV LLTVAGSLAA MLGLAGLGr:' FPKETK 



EF063-3 (SEQ ID NO:239) 

GGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAAC .7CAA7 

AACCCCGACC TGCAGTCAGA AAAGGAAACA CCTAAAAC G G CAGTATCTGA AGAAGCAACA 
GTACAAAAAG ACACTACTTC 7CAACCGACC AAAGTAG AAG AAG7AGCGCC AGAAAATAAA 
GGTACTGAAC AAAGTTCAGC TACCCCAAAT GATACCACAA ACG-GCAACA ACCAACAGTA 
GG AG CTG AAA AATCAGCACA AGAACAACCA GTAGTAAGG:: CTGAAACAAC CAATGAACCT 
CTAGGGCAGC CAACAGAAGT TGCACCAGC7 GAAAATGAAG TGAATAAATC AACGTCCATT 
CCTAAAGAAT TTGAAACACC AGACGTTGA7 AAAGCACT7G ATGAAGTAAA AAAAGATCCA 
AACATTACCG TTGTTGAAAA ACCAGCAGAA GACTTACGCA ACGTTTCTTC TAAAGATTTA 
GCTGCAAAAG AAAAAGAAGT AGACCAACTA CAAAAAGAAC AAGCGAAAAA GATTGCCCAA 
CAAGCAGCTG AATTAAAAGC CAAAAATGAA AAAATTG : -A AAGAAAATGC AGAAATTGCG 
GCAAAAAACA AAGCNGAAAA AGAGCGNTAN GANAAAGAA3 TCGCNGAATA CAACAAGCAT 
AAGAACGAAA ACAGCTATGT CAATGAAGCG ATTAGTAAAA AG C 7AGTGTT CGATCAATCT 
GTCGTGACGA AAGACACTAA AATTTCGTCG ATTAAAGG73 GAAAATTTAT CAAAGCAACT 
GATTTTAATA AAG7AAATGC AGGGGATTCA AAAGATA77T T7ACAAAATT ACGGAAAGAT 
ATGGGNGGGA AAGNT AC TGG CAACTTCCAG AAT7CCT77G TAAAAGAGGC AAATC TTGGG 
TCTAATGGTG GGTATGCGGT TCTTTTAGAA AAAAATAAA : CAGTGACAGT GACCTATACA 
GGACTAAACG CTAGTTATTT AGGACGTAAA ATTACAAAAG CAGAATTTGT T7ATGAACTA 
CAATCCTCAC CAAGCCAAAG TGGAACGTTA AATGCAG7AT TTTCAAACGA TCCGATTATC 
ACNGCTTTTA TTGGTACAAA CAGAGTCAAT GGTAAGGATG TTAAAACACG CTTAACGATT 
AAGTTCTTTG ATGCGTCAGG TAAAGAAG7A CTACCAGA7A AA3ATAGTCC ATTTGCGTAT 
GCGCTGTCT7 CTTTAAATTC AAGTTTAACG AATAAAGGTG GGCATGCGGA ATTTGTTTCT 
GATTTTGGGG CNAACAATGC GTTCAAATAC ATTAATGGNT CUTATGTGAA AAAACAAGCG 
GATGGAAAAT TTTACTCACC GGAAGATATT GACTATGGCA CAGGACCTTC TGGATTGAAA 
AATAGTGATT GGGACGCTGT AGGTCACAAG AATGCCTACT TTGGTTCAGG TGTAGGTCTA 
GCNAATGGNC GTATTTCCTT TTCTTTTGGT ATGACAA :AA A A G G AAAAAG TAATGTGCCT 
GTATCTAGTG CGCAATGGTT TGCCTTTAGN ACTAACT7AA A7G:GCAATC AGTGAAGCCT 
ATTT7CAAT7 ATGGGAATCC AAAAGAACCA G AAAAAG i. AA CGATTGAATT CAATNGATAC 
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TABLE 1. Nucleotide and Ammo Acid Seqeuenccs 



ill \;iis Genes. 



AAAGCCAATG TCGTTCGTGT NCTTG7GC CN AAT. 

NTCAATGATT TAAAT3TGAA N*:GTGGCGAT * CTTTA 1 

ACAGAACTTG CC AAA 37 AG A TCCAAAAACA GTAJVCLV 

3AT3 "AG AAA AAGTGACGAT TGATTTATCC AAAGTG 



;a;:tgatgg ncagaaaaat 
:attgtgac aggggatacg 
,:;ggattcg agatacnttt 



ELE'TQ ?ETT r 
PDLC/SEKETP 

aefsac.eqpv 
ityvekpaed 
knkaekerxx 
fn*' vnagdsk 

LNAfTYLGRK I 
FFDASGKEVL 
GKFYSPED1D 
SSAQWFAFXT 
NDLNVXRGDS 



^VQPNK 
KTAVSEEA7V 
VSPETTNEPL 
LGMVSSKDLA 
KEVAEYNKHK 
DIFTKLRKDM 
TKAEFWELC 
PDKDSPFAYA 
YGTGPSGLKN 

nlnaqsvkp: 

LQYIVTGDTT 



QK DTTSCPTX 

g£ ptevapae 
akekevd*'L£ 

KENSYVNEAI 
GGKXTGNFQN 
S^PSC'SGTLri 
LSSLN'SSLTN 
SDWDAVGHKI3 
FNYGNPKEPE 
ELAKVDPKTV 



KEVNKST 

keqakki 

SKNLVFD 

sfvkea:; 

AYFSNDP 
KGGKAEF 
AYFGSGV 
KATIEFN 
TKQGIED 



ST£ II 



te: ssatpnd ttnaqqptvg 
kefetpdvdk avdevkkdpn 
aaelkaknek laken'aeiaa 
ytkdtxissi kggkfikatd 
n3gyavllek nkpvtvtytg 
aftgtnrvng kdvktrltik 
fgan:;afkyi ngsyvkkqad 
n3risfsfgm ttkgksnvpv 
anvvpvlvpn kevtdgqknx 
ae?" yt i i3lsk v 



EF064-1 (SEQ ID NO:241) 



TGATTCTTGA 
CACTGGGTAA 
GATAATGTAC 
AACCCCGACC 
GTACAAAAAG 
GGTACTGAAC 
GGAGCTGAAA 
CTAGGGCAGC 
CCTAAAGAAT 
AACATTACCG 
GCTGCAAAAG 
CAAGCAGCTG 
GCAAAAAACA 
AAGAACGAAA 
GTCGTGACGA 
GATTTTAATA 
ATGGGNGGGA 
TCTAATGGTG 
GGACTAAACG 
CAATCCTCAC 
ACNGCTTTTA 
AAGTTCTTTG 
GCGCTGTCTT 
GATTTTGGGG 
GATGGAAAAT 
AATAGTGATT 
GCNAATGGNC 
GTATCTAGTG 
ATTTTCAATT 
AAAGCCAATG 
NTCAATGATT 
ACAGAACTTG 



AGCAACAAAT 
CTGTCCCTAT 
AAGCCGCGGA 
TGCAGTCAGA 
ACACTACTTC 
AAAGTTCAGC 
AATCAGCACA 
CAACAGAAGT 
TTGAAACACC 
TTGTTGAAAA 
AAAAAGAAGT 
AATTAAAAGC 
AAGCNGAAAA 
ACAGCTATGT 
AAGACACTAA 
AAGTAAATGC 
AAGNTACTGG 
GGTATGCGGT 
CTAGTTATTT 
CAAGCCAAAG 
TTGGTACAAA 
ATGCGTCAGG 
CTTTAAATTC 
CNAACAATGC 
TTTACTCACC 
GGGACGCTGT 
GTATTTCCTT 
CGCAATGGTT 
ATGGGAATCC 
TCGTTCCTGT 
TAAATGTGAA 
CCAAAGTAGA 



GAAAGCAAAA 
TCTTTTTCTA 
ATTAGATACG 
AAAGGAAACA 
TCAACCGACC 
TACCCCAAAT 
AGAACAACCA 
TGCACCAGCT 
AGACGTTGAT 
ACCAGCAGAA 
AGACCAACTA 
CAAAAATGAA 
AGAGCGNTAN 
CAATGAAGCG 
AATTTCGTCG 
AGGGGATTCA 
CAACTTCCAG 
TCTTTTAGAA 
AGGACGTAAA 
TGGAACGTTA 
CAGAGTCAAT 
TAAAGAAGTA 
AAGTTTAACG 
GTTCAAATAC 
GGAAGATATT 
AGGTCACAAG 
TTCTTTTGGT 
TGCCTTTAGN 
AAAAGAACCA 
NCTTGTGCCN 
NCGTGGCGAT 
TCCAAAAACA 



AAA C AA TAT A 
AGTGTGT7AG 
CAACCAGAAA 
CCTAAAA1' GG 
AAAGTAGAAG 
GATACCA AAA 
GTAGTAA ~ : CC 
GAAAATGAAG 
AAAGCAGT7G 
GACTTAGGCA 
CAAAAAGAAC 
AAAATTGCCA 
GANAAAGAAG 
ATTAGTAAAA 
ATTAAAGGCG 
AAAGATATCT 
AATTCC7T7G 
AAAAATAAA I 
ATTACAAAAG 
AATGCAGTAT 
GGTAAGG AT G 
CTACCAGATA 
AATAAAGGTG 
ATTAAT3GNT 
GACTATGGCA 
AATGCCTACT 
ATGACAACAA 
ACTAACTTAA 
GAAAAA3CAA 
AATAAAGAAG 
TCTTTACAJYI 
GTAACNAAG-C 



AGACATATAA 
GAGCCGTAGG 
CAACGACGGT 
CAGTATCTGA 
AAG TAGCGCC 
ACGG jCAACA 
CTGAAACAAC 
TGAATAAATC 
ATGAAGTAAA 
ACGTTTCTTC 
AA G G G AAAAA 
AAGAAAATGC 
TC3CNGAATA 
ACCTAGTGTT 
GAAAATTTAT 
TTACAAAATT 
TA/vAAGAGGC 
CAGTGACAGT 
CAGAATTTGT 
TTTCAAACGA 

ttaaaacacg 
aagatagtcc 
gc gatgcgga 
g:;tatgtgaa 

CAGGACCTTC 
TTGGTTCAGG 
AAGGAAAAAG 
ATGCGCAATC 

cgattgaatt 
tcact3atgg 
acattgtgac 
aagggattcg 



agctaagaat 
attagctact 
tcaacccaat 
agaagcaaca 

AGAAAATAAA 

accaacagta 
caatgaacct 
aacgtccatt 
aaaagatcca 

TAAAGATTTA 
GATTGCCCAA 
AGAAATTGCG 
CAACAAGCAT 
CGATCAATCT 
CAAAGCAACT 
ACGGAAAGAT 
AAATCTTGGG 
GACCTATACA 
TTATGAACTA 
TCCGATTATC 
CTTAACGATT 
ATTTGCGTAT 
ATTTGTTTCT 
AAAACAAGCG 
TGGATTGAAA 
TGTAGGTCTA 
TAATGTGCCT 
AGTGAAGCCT 
CAATNGATAC 
NCAGAAAAAT 
AGGGGATACG 
AGATACNTTT 
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TABLE 1. Nucleotide and Amino Acid Scqcuenccs ol E.faccalis Genes. 

GATGCAGAAA AAGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT 

CTNAACGANA AAGACTNAAA AGCTGTTGCT GCAGCNATTA ATTCAGGAAN AGCTAAAGAC 

GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACA:-:G TCACAGCAAT GATGAAAACC 

AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT ATTTAGTTGT CTTGCCGTTT 

GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAG:TG TTCAGCTGAC AAANGATGGN 

GAAACGGTAA CAAATACAGT GATTAACCAT GTGCCAGGTA GTAATCCTTC CAAAGATGTA 

AAAGCAGATA AAAACGGTAC AGTTGGCAGT GTTTCTCTAT ATC-ATAAAGA TATTCCGTTA 

CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACGT? CAGCUAACTA TGGCGGAATN 

ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGA::: ATGATCGTTT CACAGGNAAA 

TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGAX'A AAACGTTAAA AGCAGGAACA 

GATATTTCTG CCTACATTCT TTTAGAAAAC AAAGACAA r A AAGACTTGAC GTTTACNATG 

AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAATA AAGTAGGCAA ACAAGCTTGG 

TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGACG TA 3AAAACAC GCAAACAGAA 

AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGTAPACNCC TGATGATCCA 

AAACCAACCA AAGCCJTTCA TAACAAGAAA GGGGAAGANA TTAAIICATGG AAAAGTNGCT 

CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACT PAA AAGGGTACGA TAAAGACTTT 

GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTTTT TCGATGATTA CGATGAAACG 

AANGTGACAC CAATCAAAGA CTTACTTCGT GTCAAAGATT CTAAAGGGGN AGACATTACG 

AACCAGTTCA CGATCTCNTG GGACGATGCC AAAGGCA ~GG TGACNATNTC TGCCAAAGAC 

CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTG: GTGTAACNCT CCCTACAAAA 

GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGO^G AACAAAATAC ATTTGGNCAA 

CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAANCCTAA AAAAGACGTG 

GTTATTAAAG TNGGTGACAA ACAAAGTCAA AATGGNG 1'A CAATCAAATT AGGGGAGAAN 

TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT AC 3 -TGGNGT TGTGGAAGAA 

TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 

TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAA:- TGAATAAAGG GGACGACATT 

TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGTGA AAATCACGGC CAGTCAAGCC 

TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CATAOTCATG GAAAGCGTTC 

ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAACA CAATCGAAGA ATCTTTCAAC 

AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATA:g: CAGAAAAACC ACAAACNCCA 

CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAAJAT CGTAAGCACC AGTAGAGCCA 

TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGC AAACAGGCGA AAAACAAAAT 

GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGC TP? GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 



EF064-2 (SEQ ID NO:242) 

MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD r/QAAELDTQ PETTTVQPNN 
PDLQSEKETP KTAVSEEATV QKDTTSQPTK VEEVAPENKG TE^SSATPND TTNAQQPTVG 
AEKSAQEQPV VSPETTNEPL GQPTEVAPAE NEVNKST3IP EEFETPDVDK AVDEVKKDPN 
ITWEKPAED LGNVSSKDLA AKEKEVDQLQ KEQAKKIA2C.' AAELKAKNEK IAKENAEIAA 
KNKAEKERXX KEVAEYNKHK NENSYVNEAI SKNLVFL^SV VTKDTKISSI KGGKFIKATD 
FNKVNAGDSK DIFTKLRKDM GGKXTGNFQN SFVKEAKL3E NGGYAVLLEK NKPVTVTYTG 
LNASYLGRKI TKAEFVYELQ SSPSQSGTLN AVFSNDP I IT AFI GTNRVNG KDVKTRLTIK 
FFDASGKEVL PDKDSPFAYA LSSLNSSLTN KGGHAEFVSD FGANUAFKYI NGSYVKKQAD 
GKFYSPEDID YGTGP5GLKN SDWDAVGHKN AYFGSGVGLA N3 RISFSFGM TTKGKSNVPV 
SSAQWFAFXT NLNAQSVKPI FNYGNPKEPE KATIEFKXYK ANWPVLVPN KEVTDGQKNX 
NDLNVXRGDS LQYIVTGDTT ELAKVDPKTV TKQGIRETFD AEKVTIDLSK VKVYQADASL 
NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTi: ADGSWLAMG YKYLLVLPFV 
VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSNPSKDVK ADKNGTVGSV SLHDKDIPLQ 
TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFT3KVJ HAITXYDLKV GXKTLKAGTD 
ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKCAW? VYLEVERXKT GDVENTQTEN 
YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGK.VAF. GDVLSYEMTW DLKGYDKDFA 
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TABLE ! Nucleotide and Amino Acid Sequences • : l. v„,.;.';> Genes. 

f v w d l a tg v s f f z z v de tx vv ? i Kin f v x i s k c- x r : v: : ~ f t : f v: c d ax g wtx s ak d p 

QAFILAX3GQ ELRVTLPTKV KADVSGEVYN G AEiVVF G .J, I X. WTWNK I PKVXPKKDW 

1KVG3KQSQN GATIKLGEXF FVEFTSSEIP AFYAGWXXX S ' FXK1CVKK DKFSGQWSVF 

ANSNFV1ADG TKVNKGOGIS KLFT^TFEQG WXITAS.AX X : . AM;:XKEXX ITVAHSWKAFI 

GVERIAACCV YNTIEESFN'K EKIKT^TWT XTPEKPC" ? EX TV IV? FTP KTPQAPVEPL 

WEKASWPE LPQTGEKQNV LLTVAGSLAA K1GLAGLCIX. FXKETX 

EF364-3 (SEQ ID NC:243) 

AGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGC AAGT 
CTNAACGAUA AAGACTNAAA AGCTGTTGCT GCAGCXATVA AVTCAGGAAN AGGTAAAGAC 
GTGACTGCTT CTTATGANCT CAATTTAGAT CAAAACACV G TXACAGCAAT GATGAAAACC 
AACGCNGACG GNTCNGTTGT TTTAGCAATG GGGTATAAAT AVVTACTTGT CTTGCGGTTT 
GTAGTGAAAA ATGTAGAAGG CGATTTTGAA AATACAGC'.: G TTCAG C7GAC AAANGATGGN 
GAAACGGTAA CAAATACAGT GAT7AACCAT GTGGGAG3 r : A GTAATCCTTC CAAAGATGTA 
AAAG C AG AT A AAAACGGTAC AGTTGGCAGT GTTT-CTC VAC A7GATAAAGA TATTCCGTTA 
CAAACAAAAA TTTATTATGA AGTGAAATCT TCCGAACG7C CAGCNAACTA TGGCGGAATN 
ACNGAAGAAT GGGGCATGAA TGATGTCTTG GACACGACGC AVGATCGTTT CACAGGNAAA 
TGGCACGCTA TTACNAANTA TGACCTTAAA GTAGGGGA: :A AAACGTTAAA AGCAGGAACA 
GATATTTCTG CCTACATTCT TTTAGAAAAC AAAG A C A A V A AAGAGTTGAC GTTTACNATG 
AATCAAGCAT TATTGGCNGC NTTAAATGAA GGAAGCAAVA AAGTAGGCAA ACAAGCTTGG 
TCTGTGTATC TGGAAGTCGA ACGGATNAAA ACAGGTGAGG TAGAAAACAC GCAAACAGAA 
AACTACAACA AAGAGCTTGT NCGTTCTAAT ACNGTGGTGA CGCATACNCC TGATGATCCA 
AAACCAACCA AAGCCGTTCA TAACAAGAAA GGGGAAGAVA TTAAVCATGG AAAAGTNGCT 
CGTGGTGATG TTCTTTCTTA TGAAATGACN TGGGACTTAA AAGGGTACGA TAAAGACTTT 
GCCTTTGATA CAGTCGATCT TGCGACAGGC GTTTCTTV "T T :GATGATTA CGATGAAACG 
AANGTGACAC CAATCAAAGA CTTACTTCGT GTC AAAG ATT CTAAAGGGGN AGACATTACG 
AACCAGTTCA CGATCTCNTG GGACGATGCC AAAG GC AG X- T-ACKATNTC TGCCAAAGAC 
CCACAAGCCT TTATTCTAGC GNATGGTGGG CAAGAATTGC GTGTAACNCT CCCTACAAAA 
GTCAAAGCCG ATGTTTCTGG NGATGTTTAT AATTCAGC 2G AACAAAATAC ATTTGGNCAA 
CGAATTAAAA CCAATACNGT TGTCAACCAT ATTCCAAAAG TGAAHCCTAA AAAAGACGTG 
GTT ATT AAAG TNGGTGACAA ACAAAGTCAA AATGGNGC :A CAATCAAATT AGGGGAGAAN 
TTCTTCTATG AATTTACAAG TAGTGACATT CCTGCAGAAT ACGCTGGNGT TGTGGAAGAA 
TGGTCGATTA GCGATAAACT AGACGTCAAA CATGACAAAT TTAGTGGCCA ATGGTCTGTG 
TTTGCCAATT CTAATTTTGT TTTAGCAGAC GGAACCAAAG TGAATAAAGG GGACGACATT 
TCGAAACTAT TCACGATGAC CTTTGAACAA GGGGTAGT 3A AAAT 1'ACGGC CAGTCAAGCC 
TTTTTNGATG CGATGAATCT AAAAGAAAAC AAAAACGTTG CACACTCATG GAAAGCGTTC 
ATTGGTGTAG AACGAATTGC GGCAGGAGAC GTTTACAA "A CAATCGAAGA ATCTTTCAAC 
AATGAGAAGA TTAAAACNAA TACGGTAGTG ACNCATAC 2C CAGAAAAACC ACAAACNCCA 
CCAGAAAAAA CAGTGATTGT ACCACCAACA CCAAAAAI AC CGCAAGCACC AGTAGAGCCA 
TTAGTGGTAG AAAAGGCAAG TGTNGTGCCA GAATTGCCGC AAACAGGCGA AAAACAAAAT 
GTCTTATTAA CGGTAGCTGG TAGTTTAGCC GCAATGCT7 3 GCTTAGCAGG CTTAGGCTTT 
AAACGTAGAA AAGAAACAAA ATAA 

EF064-4 (SEQ ID NO:244) 

VTIDLSK VKVYQADASL 

NXKDXKAVAA AINSGXAKDV TASYXLNLDQ NTVTAMMKTN AVGSWLAMG YKYLLVLPFV 

VKNVEGDFEN TAVQLTXDGE TVTNTVINHV PGSKPSKVVK AOKNGTVGSV SLHDKDIPLQ 

TKIYYEVKSS ERPANYGGXT EEWGMNDVLD TTHDRFTGKW KAITXYDLKV GXKTLKAGTD 

ISAYILLENK DNKDLTFTMN QALLAALNEG SNKVGKQAWS V Y1EVERXKT GDVENTQTEN 

YNKELVRSNT WTHTPDDPK PTKAVHNKKG EXIXHGK7AR GXV'LSYSMTW DLKGYDKDFA 

FDTVDLATGV SFFDDYDETX VTPIKDLLRV KDSKGXDITN QFTISWDDAK GTVTXSAKDP 
QAFILAXGGQ ELRVTLPTKV KADVSGDVYN SAEQNTFGQR IKTUTWNHI PKVXPKKDW 
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IKVGDKQSQN GATIKLGEXF FYEFTSSDIP AEYAGWEEiJ £ ISDKLDVKH DKFSGQWSVF 

ANSNFVLADG TKVNKGDDIS KLFTMTFEQG WKITASQAE XEAMNLKENK NVAHSWKAFI 

GVERIAAGDV YNTIEESFNN EKIKTNTWT HTPEKPQTP? EF.TVIVPPTP KTPQAPVEPL 
WEKASV 



EF065-1 (SEQ ID NO:245) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GG AC AAGCT A CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTAT3 GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACo GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATG/\CAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
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G AA C C T AAA C AACCGCTAAA A:GGAAAAAA CCGTTGAC" CTACAAATCA TCAAGCACCA 

^„^^rnr-.~~ ATTTACC AATG AC TAAT 

ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTCAATC:; CCCTTATAGT GG CTATTAGC 

TTT3GCATAA CAAAAAATAA AAAAAGAAAA. AATTA.G 



ZF065-2 (SEQ 2D NC:246) 

KF KKATKLLSTM VIVAG7VVGN FSPTLALAEE AVKAGET EG M TNTVKVKDDS 
LACCKF IIjEG QATFPVQAGE TE?\TLTO'E capgcfsc:;?' fhvrqailev vqglsdqdrv 
MLASYRGGKQ FMFPDGKTK I KSADYEKNYR VNTQLTYEKC CFVSGFGlVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTKLDGYLHK TMTNDSINEY PEPRHPLQVS 
VZYSNDYQGA AAEVLALNQE ITN^GYEMIN AYWESVES1F SYTISYFDFYK TFVGPFVKQE 
LQCCSSTPED FITSQSIDDF TTQEFQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
XEVPVQINGQ TISATSTEGY VGNITI HYEV KENTAIEAAT LV;:SGTMi:QG TIAKEFPEAT 
1PKNDNAHAC DVTPEDPTT T KDIENC EHLD LTNREDSFDV; HVTTAFGKET STWTQASMVD 
DirrKVLDIID VKVTDEHGKD VTANGTVTQE ^NXYTFEK:;?; QAESYDYLSG HTYTMTITTK 

iktdatdeel apyieqggip ncadlnfgne gdvlhs^k^t vtpppvdpni AKDVEGQEHL 

DLTtJRDQEFK WIIVKTAFGNE TSTWTQASMV DDINKVLEIT DYr'.VTDEUGK D\ r TANGKVTQ 
ENIIKVTFEMN XQADSYDYLS GHTYTMTITT KIXASATEZE LAPYIEQGGI PNQADLNFGN 
EGDVLK SNKP TVTPPAPTPE DPTIT?;DIEG OEHLDLTtCEr: OHFKWNVKTA FGNETSTWTQ 
ASKVDDINKV LDITDVKVXX EXGKDVTDWG IVTQEKMKVT F7MNKKDDSY SYLAGHTYTM 
TITTK1KTDA TDEELAPYIE QGGI PNQADL UFGNEGDVLH SNXPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNKQAPT NPVNFGKSAS KGIHLPMTCT TYNPLYMIAG LIVLIVAISF 
GITKNKKRKN 



EF065-3 (SEQ ID NO:247) 

GGTTA AAGCAGGAGA TACAGAAGGA ATOAC ""AATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTAT IT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATAC AC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCA AA.GTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA TCAATTTGAT ATTCAATC'TG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA G T C AAAG AAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CCCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT G1ACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TIACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGC 2TT ACATTGAACA AGGCGGGATT 
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CCCAACCAAG CCGACTTAAA CTTTGGCAAT 
ACCGTAACAC CACCGCCAGT TGATCCAAAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT 
GAAACAAGCA CTTGGACCCA AGCCAGCATG 
ACTGATGTAA AAGTCACAGA TGAAAATGGT 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG 
AGTGGTCATA CGTACACAAT GACCATTACT 
GAATTAGCAC CTTATATTGA ACAAGGTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA 
GAAGATCCAA CGATTACAAA AGATATCGAA 
GACCAAGAAT TTAAATGGAA CGTCAAAACA 
CAAGCCAGCA TGGTGGATGA CATTAATAAA 
GANGAAAATG GCAAAGATGT TACAGATAAT 
ACTTTTACTA TGAACAAAAA AGATGACAGC 
ATG AC TATTA CCACTAAAAT TAAAACTGAC 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC 
CATTCCAACA AGCCAACCGT AACACCGCCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA 



gaaggtga:g TGTTACATTC CAACAAACCA 
ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GTAGATGACA TTAATAAAGT GTTAGACATC 
AAAGATGTTA CAGCTAACGG CAAAGTAACA 
AACAANCAAG CNGACAGCTA TGACTATTTA 
actaaaat:a AAGCTAGCGC AACGGACGAA 
attcccaag: aagccgactt gaactttggc 
ccaaccgtaa caccacctgc accaacgcca 
ggccaagaa: atttagattt aaccaaccgt 
gctttcggta acgaaacaag cacatggacc 
gtgttagaca tcacagacgt gaaagttnct 
ggcatagt/va cacaagaaaa taacaaagta 
tactctta":t tagctggtca tacatacaca 
gcaacggatg aagaattagc gccttatatt 
ttaaactttg gcaacgaagg tgacgtgttg 
gcaccaacg: cagaagaccc aaaaaaacct 
ccgttgacg: ctacaaatca tcaagcacca 
agtaaaggaa tt 



EF065-4 {SEQ ID NO:248) 



AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE 
MLASYRGGKQ FMFPDGKTKI 
LKLALDTYNQ THGDLTNRKT 
VEYSNDYQGA AAEVLALNQE 
LQQGSSTPED FITSQSIDDF 
NDVPVQINGQ TISATSTEGY 
IPKNDNAHAC DVTPEDPTIT 
DINKVLDIID VKVTDENGKD 
IKTDATDEEL APYIEQGGIP 
DLTNRDQEFK WNVKTAFGNE 
ENNKVTFEMN xqadsydyls 

EGDVLHSNKP TVTPPAPTPE 
ASMVDDINKV LDITDVKVXX 
TITTKIKTDA TDEELAPYIE 
PKQPLKPKKP LTPTNHQAPT 



TEPVDLVWE DASGSFSDHF 
NSADYDMNVR VNTQLTYDKr 
YFLLVTDGVA NTRLDGYLHX. 
ITNQGYEMIN AYVJESVESL^ 
TTQLKQIVKD RLAQSTPATA 
VGNITIHYEV KENTAIDAAT 
KDIENQEHLD LTNREDSFDVJ 
VTANGTVTQE NNKVTFEMI IX 
NQADLNFGNE GDVLHSNK PT 
TSTWTQASMV DDINKVLDIT 
GHTYTMTITT KIKASATDEE 
DPTITKDIEG QEHLDLTNRD 
ENGKDVTDNG IVTQENNKVT 
QGG I PNQADL NFGNEGDVLH 
NPVNFGKSAS KGIH 



PHVRQAIDEV VQGLSDQDRV 
QFVSGFGDVR TYGGTPTAPG 
THTNDSINEY PDPRHPLQVS 
SVNSYFDKYK TEVGPFVKQE 
SLTIANQFDI QSATATDDAG 
LVSSGTMNQG TIAKEFPEAT 
HVKTAFGNET STWTQASMVD 
QADSYDYLSG HTYTMTITTK 
VTPPPVDPNI AKDVEGQEHL 
DVKVTDENGK DVTANGKVTQ 
LAPYIEQGGI PNQADLNFGN 
QEFKWNVKTA FGNETSTWTQ 
F7MNKKDDSY SYLAGHTYTM 
SIJKPTVTPPA PTPEDPKKPE 



EF066-1 ( SEQ ID NO:249) 



TAGCGAAAGA AAATAGGGAG 
ATGGTGATTG TCGCTGGAAC 
GAAGCGGTTA AAGCAGGAGA 
AGTCTGGCTG ATTGTAAACG 
GAAACGGAAC CAGTCGATTT 
TTTCCACATG TAAGACAAGC 
GTGATGCTGG CTTCATATCG 
ATTAATTCAG CTGATTATGA 
AGCCAATTTG TCTCTGGTTT 
GGATTGAAAC TCGCTTTAGA 
ACGTATTTCC TATTAGTGAC 
AAGACCAATA CCAATGATTC 
TCAGTGGAAT ATAGTAATGA 



GATTAAAATG TTTAAGAAAG 
AGTTGTGGGA AATTTCAGT : 
TACAGAAGGA ATGACCAATA 
GATATTGGAA GGACAAGCTA 
AGTAGTTGTT GAAGATGCTA 
GATTGATGAA GTGGTTCAA 3 
CGGCGGAAAA CAATTTATGT 
TATGAATGTG CGCGTCAATA 
TGGAGACGTT CGGACGTATG 
TACGTACAAT CAAACACACG 
AGATGGGGTC GCTAATACA " 
AATCAATGAA TATCCAGATC 
CTACCAAGGT GCAGCAGCAG 



CAACGAAATT ATTATCGACA 
CCACATTGGC TTTAGCTGAA 
CGGTGAAAGT GAAAGACGAC 
CTTTCCCAGT TCAAGCGGGT 
GTGGTAGTTT TTCAGATAAT 
GCTTATCTGA TCAAGACCGC 
TTCCTGATGG AAAGACAAAA 
CGCAATTGAC TTATGATAAA 
GTGGTACGCC AACCGCCCCA 
GAGATTTAAC GAATCGAAAA 
GTTTAGATGG TTACTTGCAT 
CAAGACATCC TCTTCAAGTC 
AAGTTTTAGC GTTAAACCAA 
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r:AAV- AC- A-CAAG3CTA T3AAA7GA7T AA7G3G7ATT GGGAAAGT3T 7GAATC7TTA 
A377CAG7GA AT7CATA=7T 7GA7AAA7AT AAAA. Z A G AA G 7GGGTCCTT7 737AAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GA7777ATTA CAAGCCAATC TATTGATGAT 
TT7ACAACCC AATTAAAACA AA77GTCAAA GATCGTZTGG CGCAATCGA: AGCAGCAACA 
GC^CATTAA C3ATTGCCAA 7CAA7TT^AT AT7GAA7CTG CGACCGCTA: G ^ACGATGCT 
GGAAATGATG 73CCTG77CA AA77AAC jGA :AAA:CA777 CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTAC 3AA GTCAAAGAAA ATACAGCGAT T3ATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAA:AA7TG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TG:GCATGCG TGT3ACGTGA :GCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA 3ATTTAACCA ATCGTGAAGA TAGTTTCGAT 
TGG:ATGTCA AAACAGCCTT TGGCAACGAA ACCA3TACTT GGACCCAAGC CAGCATGGTG 
GAT3ACATTA ATAAAGTGCT AGA7A7CATT GA7GTGAAAG TCACCGACGA AAATGGTAAA 
GAT3 m TACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTA7TTAAG7 GGTCATA:GT ATACAATGAC TATCACCACT 
AAAA TT AAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGAC7TAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAG7 TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATT7 AAAT3GAACG TCAAAACAGO TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCA7G GTA3ATGACA TTAATAAA3T GTTAGACATC 
AC TG ATGTAA AAGTCACAGA 7GAAAATGGT AAAGA7GTTA CAGCTAAC3G CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC 7TTTGAAATG AACAANCAAG CNGACAGC7A TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAAT7AGCAC CTTATATTGA ACAAGGTGGC AT7CCCAACC AAGCCGAC77 GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGAT77 AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCT77CGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA C ATT AA T AAA GTG7TAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTT7TACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCG7TGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTACC AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGCATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF066-2 (SEQ ID NO:250) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS 
LADCKRILEG QATFPVQAGE TEPVDLWVE LASGSFSDMF PHVRQAI7EV VQGLSDQDRV 
MLASYRGGKf FMFPGGKTKI NSADYDMNVR VMTQLTYDXS QFVSGFG7VR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSKIEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIII AYWESVESLS SVNSYFD?:YK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD F.LAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KEIITAIDAAT LVSSGTMIJQG TIAKEFPEAT 
I PKNDNAKAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGI IET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE IttTKVTFEMNK QADSYDYLSG HTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDEHGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQEIJNKVT FTMNKKDDSY SYLAGKTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
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PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMI AG LIVLIVAISF 
GITKNKKRKN 

EF066-3 (SEQ ID NO:251) 

GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 
AAGACCAATA CCAATGATTC AATCAATGAA TATCCAGATC CAAGACATCC TCTTCAAGTC 
TCAGTGGAAT ATAGTAATGA CTACCAAGGT GCAGCAGCAG AAGTTTTAGC GTTAAACCAA 
GAAATTACTA ACCAAGGCTA TGAAATGATT AATGCGTATT GGGAAAGTGT TGAATCTTTA 
AGTTCAGTGA ATTCATACTT TGATAAATAT AAAACAGAAG TGGGTCCTTT TGTAAAACAA 
GAGTTGCAAC AAGGGTCTAG CACACCAGAA GATTTTATTA CAAGCCAATC TATTGATGAT 
TTTACAACCC AATTAAAACA AATTGTCAAA GATCGTCTGG CGCAATCGAC ACCAGCAACA 
GCTTCATTAA CGATTGCCAA T C AATTTG AT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCTGTTCA AATTAACGGA CAAACCATTT CAGCAACTAG TACAGAAGGT 
TACGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATACAGCGAT TGATGCAGCA 
ACCCTTGTAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGATTCCTA AAAATGACAA TGCGCATGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAAC C A ATCGTGAAGA TAGTTTCGAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACCAGTACTT GGACCCAAGC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCA 

EF066-4 (SEQ ID. NO : 252 ) 

AVKAGDTEGM TNTVKVKDDS 

LADCKRILEG QATFPVQAGE TEPVDLVWE DASGSFSDNF PHVRQAIDEV VQGLSDQDRV 
MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDSINEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE 
LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTA I DAAT LVSSGTMHQG TIAKEFPEAT 
I PKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET STWTQASMVD 
DINKVLDI ID VKVT 

EF067-1 ( SEQ ID NO:253) 

TAGCGAAAGA AAATAGGGAG GATTAAAATG TTTAAGAAAG CAACGAAATT ATTATCGACA 
ATGGTGATTG TCGCTGGAAC AGTTGTGGGA AATTTCAGTC CCACATTGGC TTTAGCTGAA 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC 
AGTCTGGCTG ATTGTAAACG GATATTGGAA GGACAAGCTA CTTTCCCAGT TCAAGCGGGT 
GAAACGGAAC CAGTCGATTT AGTAGTTGTT GAAGATGCTA GTGGTAGTTT TTCAGATAAT 
TTTCCACATG TAAGACAAGC GATTGATGAA GTGGTTCAAG GCTTATCTGA TCAAGACCGC 
GTGATGCTGG CTTCATATCG CGGCGGAAAA CAATTTATGT TTCCTGATGG AAAGACAAAA 
ATTAATTCAG CTGATTATGA TATGAATGTG CGCGTCAATA CGCAATTGAC TTATGATAAA 
AGCCAATTTG TCTCTGGTTT TGGAGACGTT CGGACGTATG GTGGTACGCC AACCGCCCCA 
GGATTGAAAC TCGCTTTAGA TACGTACAAT CAAACACACG GAGATTTAAC GAATCGAAAA 
ACGTATTTCC TATTAGTGAC AGATGGGGTC GCTAATACAC GTTTAGATGG TTACTTGCAT 



WO 98/50554 



PCT/LS98/08959 



151 

TABLE \ . Nucleotide and Ammo Acid Seqeuences of E. faecalis Genes. 

AAGACCAA7A CCAATGATTC AA7CAAT^AA 7A7CCA'"4ATC CAAGACA7CC "TTC^GTC 
7CAGTGGAA7 A7A37AA7GA C7ACCAA337 GCAGrA.CAG AACTTTTAGC ^^^A 
GAAATTACTA ACCAAGGC7A TGAAATGA77 AA7GCG7A7T G.^... .^TCTTTA 
A3TTC AGTGA ATTCATACTT TGA7AAATA-" ^A.^.-,^ 

GAGTTGCAAG AAGGGTCTA3 CACACCAGAA GATTTTATTA CAAGCCAAT: -ATTGATGAT 

— r^r^rrr .attaaaaga aattgtcaaa gatcgt :tgg cg:aatcga: accagcaaca 

C-CT-CATTAA CGATT3CCAA TCAATTTGAT ATTCAATCTG CGACCGCTAC GGACGATGCT 
GGAAATGATG TGCCT3TTCA AATTAACGGA CAAACCATTT CA3CAAC7AG TACAGAAGGT 
TArGTAGGAA ACATCACGAT TCACTACGAA GTCAAAGAAA ATA_AGCX-«T T^ATGCAOLA 
ACrC~X3TAA GTAGTGGGAC AATGAATCAA GGAACAATTG CTAAGGAATT TCCAGAAGCG 
ACGA-TCCTA AAAATGACAA TGCGCATGCG TCTGAC3TGA CGCCAGAAGA TCCAACGATT 
ACAAAAGATA TCGAAAATCA AGAACACTTA GATTTAACCA ATCGTGAAGA TAG TTTC GAT 
TGGCATGTCA AAACAGCCTT TGGCAACGAA ACC AGTACTT GGACCCAA.GC CAGCATGGTG 
GATGACATTA ATAAAGTGCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 
GATGTTACAG CTAACGGCAC AGTAACACAA GAAAA TAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGA: TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCCCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAG3TGACG TGTTACAT'.'C CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTG^TAAAG ACGTAGAAGG ACAAGAACAT 
TAGATTTAA CCAACCGCGA TC AAG AATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG G7AGATGACA TTAATAAA 3T GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAAC'.G CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAAtlCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAAC TTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGA'i "IT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA CATTAATAAA GTGTTAGACA TCACAGAC JT GAAAGTTNC. 
GANGAAAATG GCAAAGATGT TACAGATAAT GG TATAGTAA C AC AAG AA. AA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGG1CA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAACTGAC GC AACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGAC "C AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAAT^A TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TTCATTTA :C AATGACTAAT 
ACAACAGTAA ATCCACTTTA CATGATCGCA GGTTTAATTG TCCTTATAGT GGCTATTAGC 
TTTGGC ATAA CAAAAAATAA AAAAAGAAAA AATTAG 

EF067-2 (SEQ ID NO:254) 

MF KKATKLLSTM VIVAGTWGN FSPTLALAEE AVKAGDTEGK TNT/: VKDDS 
LADCKRILEG QATFPVQAGE TEPVCLVWE DASGSFSDNF PHVRQAIZEV VQGLSDQDRV 
MLASYRGGKQ FKFPDGKTKI NSADYDMNVF. VKTQLTYDKS QFVSGFGZVR TYGGTPTAPG 
LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTP.LDGYLHK TNTNDSIIIEY PDPRHPLQVS 
VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYV.'ESVESLS SVNSYFDV.YK TEVGPFVKQE 
LQQGSSTPEE FITSQSIDDF TTQLKQIVKD RLAGSTPATA SLTIANfFDI QSATATDDAG 
NDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIDAAT LVSSGTK::QG TIAKEFPEAT 
IPKNDNAHAC DVTPEDPTIT KDIEI1QEHLD LTUREDSFDW HVKTAFGIIET STWTQASMVD 
DINKVLDIID VKVTDENGKD VTANGTVTQE KUKVTFEMNK QADSYDYLSG KTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVEPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV BDIKKVLDIT DVKVTDEIIGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEC-GGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEKLDLTNRD QEFKWNVKTA FGNETSTWTQ 
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TABLE 1. Nucleotide and Ammo Acid Seqeuences of E. faecalis Genes. 

ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF 
GITKNKKRKN 

EF067-3 ( SEQ ID NO:255) 

GCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA 

GATGTTACAG CTAACGGCAC AGTAACACAA GAAAATAACA AAGTAACTTT TGAAATGAAC 
AAACAAGCAG ACAGCTATGA CTATTTAAGT GGTCATACGT ATACAATGAC TATCACCACT 
AAAATTAAAA CTGACGCAAC GGACGAAGAA TTAGCGCCTT ACATTGAACA AGGCGGGATT 
CCCAACCAAG CCGACTTAAA CTTTGGCAAT GAAGGTGACG TGTTACATTC CAACAAACCA 
ACCGTAACAC CACCGCCAGT TGATCCAAAT ATTGCTAAAG ACGTAGAAGG ACAAGAACAT 
TTAGATTTAA CCAACCGCGA TCAAGAATTT AAATGGAACG TCAAAACAGC TTTCGGTAAC 
GAAACAAGCA CTTGGACCCA AGCCAGCATG GTAGATGACA TTAATAAAGT GTTAGACATC 
ACTGATGTAA AAGTCACAGA TGAAAATGGT AAAGATGTTA CAGCTAACGG CAAAGTAACA 
CAAGAAAATA ACAAAGTAAC TTTTGAAATG AACAANCAAG CNGACAGCTA TGACTATTTA 
AGTGGTCATA CGTACACAAT GACCATTACT A C T AAAATC A AAGCTAGCGC AACGGACGAA 
GAATTAGCAC CTTATATTGA ACAAGGTGGC ATTCCCAACC AAGCCGACTT GAACTTTGGC 
AACGAAGGTG ACGTGTTGCA TTCCAACAAA CCAACCGTAA CACCACCTGC ACCAACGCCA 
GAAGATCCAA CGATTACAAA AGATATCGAA GGCCAAGAAC ATTTAGATTT AACCAACCGT 
GACCAAGAAT TTAAATGGAA CGTCAAAACA GCTTTCGGTA ACGAAACAAG CACATGGACC 
CAAGCCAGCA TGGTGGATGA C ATT AAT AAA GTGTTAGACA TCACAGACGT GAAAGTTNCT 
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA CACAAGAAAA TAACAAAGTA 
ACTTTTACTA TGAACAAAAA AGATGACAGC TACTCTTACT TAGCTGGTCA TACATACACA 
ATGACTATTA CCACTAAAAT TAAAAC TG AC GCAACGGATG AAGAATTAGC GCCTTATATT 
GAACAAGGCG GGATTCCCAA CCAAGCCGAC TTAAACTTTG GCAACGAAGG TGACGTGTTG 
CATTCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC CAGAAGACCC AAAAAAACCT 
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTTGACGC CTACAAATCA TCAAGCACCA 
ACGAACCCAG TCAATTTTGG AAAATCAGCA AGTAAAGGAA TT 

EF067-4 (SEQ ID NO:256) 

VLDIID VKVTDENGKD VTANGTVTQE NNKVTFEMNK QADSYDYLSG KTYTMTITTK 
IKTDATDEEL APYIEQGGIP NQADLNFGNE GDVLHSNKPT VTPPPVDPNI AKDVEGQEHL 
DLTNRDQEFK WNVKTAFGNE TSTWTQASMV DDINKVLDIT DVKVTDENGK DVTANGKVTQ 
ENNKVTFEMN XQADSYDYLS GHTYTMTITT KIKASATDEE LAPYIEQGGI PNQADLNFGN 
EGDVLHSNKP TVTPPAPTPE DPTITKDIEG QEHLDLTNRD QEFKWNVKTA FGNETSTWTQ 
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMNKKDDSY SYLAGHTYTM 
TITTKIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVTPPA PTPEDPKKPE 
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIH 



EF068-1 (SEQ ID NO:257) 

TAGGGGAAGC TAATGATCTT GGTATTTATC 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT 
CGCAAGATGC TTAATTTAGC AATATCAAGT 
GTAAGTATAG CTGTTACCTC TGGCACAATC 
CTATTATCAA ATGTTACGTC AAATAATGAC 
GCCGCAAACC AAAATC AACC AGTTAATTTC 
TCCGCTGTGT TT AG TGG AC A AAAACAAGCG 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC 



GTTTATTTTA AAGAAAAGAG GGACGATCAG 
CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
GGTTTATTAT TTACGTCATT AGCAATCCCT 
AGTGCATCAG CAGCGGTCTT GGATATCGAA 
AGTGGCACTT CAACGAGTAA TCGTTGGACA 
ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATACCAATG TCACGATTGA TCTTTCAAAA 
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TABLE ;. Nucleotide and Ammo Acid Seqcuenccu of E. facialis G.-ncs. 

GTTAC77777 7GAC7GCCG7 TTTGAATGCA G C C AATG ATT TAACCAA'I 37 GA77ACTCAA 
A7TACCAG7G GGG CGTTAGG GAAT77AAC7 GG73TTGATA TTGATTTGAC GGAAGTGAAT 
'"G7CAATT3G AAT7AGTTAA- TAAGA7TGAA AACTTAGGTG C73CTTCA7T TACAGCTCCG 
GAAACGTTAG CAGCTGACC3 C7CA7AC A77 AGTGCACCGA TTAGTGA . 3G TTTAGGGTTA 
GT7TTAGCCC AAAATGTTTC AAACATOTTA "AA3A777GA ATSCGGCA3T 7CAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAG7AA7 C773TCGCCG CAGCTATAAA TGCAGCCTTG 
CTT-CTGTCA AAGGCACGGT AAACGTGGC7 GTT7CAGGTG CTTTGCCT7T ATTAGCGGTT 
GGTGGTTCAG GC3TAAATGA GTTAGTGGAT GCT7C7TTAC TAGGCACAAC CAGGGTTACT 
TTACCAACTA CCGTTTCAAC ACC7CAAAA7 TTATCCAATA ATTTAGATG :; TCGTTTTGTA 
GGAACAGTCG T7CAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAG ?. AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAA.: AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGAC3 3 CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGC73A TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGKACG CCAACAACGT TCCAAACIJCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA G7GACAGGTA ATTCAA3GGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG G7ACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAA! AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTAC/-A.C AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGA~GC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGAC77T AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 
n AAC C AG C AA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TG AAATTAC A GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAA7 CCTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCT "A TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTAT 2A GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAA:3T GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGG3AC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAG 7 AC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAA-3T TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGC 3A AT AT AC TGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTJ 3C TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGG AC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCC73G TG AT AC ATT A 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCAC AA TG AAGCAGGT 
TTAG TTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATC AC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGC7GC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATC3C AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

GTTACTTTTT TGACTGCCGT TTTGAATGCA GCCAATGATT TAACCAATGT GATTACTCAA 

ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 

CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 

GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 

GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 

GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 

CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 

GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 

TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 

TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 

GTAACAGGTA ATTCAACAGC AGGTTACGAA G TT AAA GG AA CTGCCGATGC CAATGCCACG 

GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 

GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 

GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 

GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 

GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 

ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 

GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 

CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 

GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 

GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 

GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 

GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC AGCGGATCCT 

AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGG C TAT 

GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 

GTCCTTGGTA CTGCAACAAC TGGAACTGAC G G AAAAT AT A CAGTGACTTT AGATTCAGGA 

ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAGTGG CACGGAAAGT 

CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 

GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 

GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 

ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACAGT GATTGGAAAG 

GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 

ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 

ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACGT GATTGCAACA 

GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGGGAC GGCAAATGCG 

AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 

TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 

ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCACCAT TGAAGTTCGT 

GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGCCA ATATACTGTG 

ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 

ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 

ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 

AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 

GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 

ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 

GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 

TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 

TT AG TTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 

GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 

AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 

GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGAGC AGCAGAAGTT 

GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 

ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
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ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAG7CAGC CAGCTACAGG AACTACACCA 
G TC G A C TT AG C T A C A 2 C AA 2 C A TT G ATT 2 7 ATT A. - C G G AA. A TTC TAG. AG-. A. G G TT A C G AA 
ATGACTGGAA CGGTGGAGCC AAAAACCACT ATTGATGTGG GTGACGCAGA CG3AACCATC 
A.TT 3 CTGCTA* CA-ATTGCTAA CGAAA.CCG3C CAA.TATACGG TGACTCTA7 2 A3CTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
C "J A GCTACAG C2 3TTATTCC A 3 7GGA.T2TT GTTTTAG -GG CGTGAACTAT TACGAAGGTT 
GAAGGAAACA AA. 22 Z AA. T 3 G CTATACAGTC A7-GGAAJTG CTGATCGAAA. TG TCACGG'IT 
CAATTTTACA ATAGGAGTGA ATAATTATTG G 3AAGTG3CA ATACAACTAC TGGAGGTACC 
TT:TCGGTT: ATATTGCAGC A3GGTTAGCA ACAGAAAAA5 AAACGTTAA2 CGCACTAACC 
ATAGATAGA7 AAG3AAATGT GA3TCC7AAA AGCA GATTTA TGACGCGAG 2 C 5ATATTACG 
GGA3AACGAG AGATTAAAAT TG E GGG AC C A ACTGTTT-TT CAGTTTTAG j AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 
CTATTAAGAA GTGT3ATTGC TGTA3GTGCC ACCGATGCT3 AAGGCAACTT CGCTATCCAA 
TTAACAGCG3 GACAAGCGA: TG:T:AACAA AGTTTACTT3 CGACAGCTA: CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TrCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATATAGGC3 GCAATACA3 3 AAACAATGGC 
GCAACTGGC 2 GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTG TGGCTTAGGT TCACTAGGCA ATGGCCTCG3 TACAAATGGT 
AGTGGCTAC 2 ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATCA CGGGAAAACA 
GGCTACTTAG CTAGCACAG3 TG AAAAA.GAG TCTTCAGCCG TGAC AAC AA 3 TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGG GAGCATGGGA ATCATCAAAC GCAAACGTAA A AAC TAG 

EF068-2 (SEC ID NO:258) 

M KKKIVELFNR KSQHKKWTKR KKLNLAISSG LEFTS LAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNPWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VD1DLTEVNR C-LELVNNIEN LGAASFTAPE TLAADGSY IS APISDGLGLV 
LAC'NVSNILQ DLNAAVQALE AKGTS I PSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV K G T A DAN ATV E I RNAGGTV I GTGTADGTGA 
FTVTVPAGEA GA1IETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
KG TAD AN ATV EI RNAGGTV I GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGKSTAGYEV KGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTI PAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
vkgtaevgtt IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD IJAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NL PAG TAN AN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TG PTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAK TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TT IGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTNG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGEE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTI I AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGYTVT GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVKIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGNNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 
EF068-3 (SEQ ID NO:259) 

CTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 

CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCC AATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAC GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA TAACATTGAA AAC TTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGCTGA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACNCC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACGGC AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGAT3C CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACGCC 



EF068-4 (SEQ ID NO:260) 

TSGTIS ASAAVLDIEL LS1STVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLNAA NDLTNVITQI 
TSGALGNLTG VD I DLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSY IS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAA I NAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLDV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTP 

EF069-1 (SEQ ID NO:261) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 
ATGAAAAAGA AAATTG TTG A GGATTTTAAT CGGAAAAGTC AGCATAAAAA ATGGACAAAA 
CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 
GTAAGTATAG CTGTTACCTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA 
CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TCGTTGGACA 
GCCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGCA GCC AATGATT TAACCAATGT GATTACTCAA 
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TABLE 1. Nucleotide and Ammo Acid Scqeuences of E. jaecahs <:cnc<. 

A— ACCAGT^ GGGCGTTAGG G AA-TTTAA C T GG7GTTGA7A TTGATTTC AG GGAAGTGAAT 
CGTCAATTGG AATTAGTTAA 7 AA C ATTG AA AAG77A GGTG CTGCTTC A.T TACAGCTCCG 

^ ^rp^.^.^^rr^r ^--nCACCGA TTAGTGA7GG 7 - 7AGGGTTA 

ipm> __ , — , ~ , ,,.^,~^ m -> . — — - --a A~GCGGC-' GT 7CAAGCTTTG 

GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTT3TC3CCG CAGCTATAAA TGCAGCCTTG 
— CTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCXTT ATTAGCGGTT 
GGTGGT^CAG GCGTAAATGA GTTAGTGGAT GCT7CTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CC 3T7TCAAC ACC7CAAAA7 TTATCCAATA ATTTAGATG 2 TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCT77TAGAC GTT AATTT AT TAGCAACA .5C AGACGGTGTA 
T--CAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACrAAC AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGAT" CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGC7GA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNAC Z CCAACAACGT TCCAAA TC AGCGGATGAA 

gcaaccgtaa ccgcaccaac aatcacagga gtgacaggta attcaa:-g:vC aggttacgaa 

GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATG!" A "G AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATGC CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGC1 GA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAAX-CC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTG TTG AG ACTGTAACAG GTAGTACAAC AAAAGGCTAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAA7ATA CAGTGACTTT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT GTAGCGAAAA ACGCTAC7GG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 

„_„ m r,>^mm^^ r^o * A^h<-f-7vr« irTTT't i" u P AACAATCGAA 

GGCAACTCTb Ui l'Lb^llrt ib/inniirttn vjr^r^^rtvj^rt^ - ~ - - 

GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGXAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAAXGT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTGA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTA . CA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAA.TGT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGG'JAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCAC7AT TG AAGTTC GT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGG -CA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGT7GC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGT IAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGG^AC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTAC AAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCT3G TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGG AG C AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCA7GGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
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GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTC TAG T A A AGGTTACGAA 
ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 
ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTA : Z AGCTGGCGTA 
GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 
CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAGCGG CGCCAACTAT TACGAAGGTT 
GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCA/^A TGTCACGGTT 
CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTA Z TGGAGGTACC 
TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAA Z CGCACTAACC 
ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 
GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTA'^ AACGTCTAAA 
GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TC ATTC AAA T TAGTAACCGA 
CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 
TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 
GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 
GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACAG 3 AAACAATGGC 
GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 
GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTC Z 2 TACAAATGGT 
AGTGGCTACC ACCCTAAACT AAGTAC C ATC AGTTATGGCA CTGGAAATC A CGGGAAAACA 
GGCTACTTAC CTAGCACAGG TGAAAAAGAG TCTTCAGCCG TGACAACAAG TTTGTTTGGC 
GCCTTTGTCG CACTCCTTGC GAGCATGGGA ATCATCAAAC GCAAACGTAA AAACTAG 

EF069-2 (SEQ ID NO:262) 

M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSNRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV LWPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLIIAA NDLTNVITQI 
TSGALGNLTG VDIDLTEVNR QLELVNNIEN LGAASFTAPE TLAADGSYIS APISDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNVAV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL PTTVSTPQNL SNNLDARFVG TWQTDLLFV NLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGT'I GTGTADGTGA 
FTVTVPAGEA GANETLTAVA KNASGTEXTP TTFQTPADEA TVTAPTITCV TGNSTAGYEV 
KGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGA V I GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVKT VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTLSW AKNASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVT-N SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NL PAG TAN AN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ TVNWAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDGNE SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTCGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTTMG YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNATGKE SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTI I AATTANET ZQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANG Y T : > T GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT EKETLTALTT DTQGNVSPKT TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGKNGN TGGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLFTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMG I IKRKRKN 



EF069-3 (SEQ ID NO:263) 
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TABLE 1. Nucleotide and Amino Acid Seqcucnccs of £. facialis Gene.v 



A^j^TGAA ^=G>-iGGTGGGA A. - 1 G,«j-iAGGTT nn^'-o^CG^A 

G^.'GAAAAAGG GGAGCGGTA^C AGAAAGTAGG GCAA.GAACGT ^ GG AAj~. - ~- - — v^G^GGA * ^'v-T 
AATAGGGGGG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTA:AAG AAAAGGCTAT 
GAGGTCAAAG GGAGTGCTGA AGTTGGCACC ACGATTGAGG TTGGCGATGG AGGTGGCACG 
GTCCTTGGTA GTGCAACAAC TG3AAGTGAC GGAAAATATA CAGTGAITTT AGATTCAGGA 
ATAGCAATAG CAAATCAAAC G G TG AG CGTT 3TAGCGAAAA ACGCTAGTGG CAJGGAAAGT 
-AACCAGCAA CGGCGACAAC ACGAGCTGAT GTCACTGTAC CAACAGTTGA TAAGATCACA 
3 3CAACTCTG GTTGGGGTTA TGAAATTACA G3AACAGGAG ACCCTAACAC AATAATCGAA 
GTTCGTGATC CATGTGGGGG AGTGATTGGT A JAGGTA ~CT CTGATGGGAA TGGTGATTTT 
ATTGTAAGGC TATCAACGGG AA UGACCAAT C GTGGGGATA CGTTAACAGT GATTGGAAAG 
GATAAGGGGG GAAATGAAAG TrAACCGACT GAAGTCCTTG TTCCTGGTGA TG:CACGGTT 
A ZAGCACCAA CT3TAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA 3 3TGACAGGC 
ATCGCTGATC CGAATGCTAT CATGGAAATT C5TGATGGAG ATGGGAAGGT 2ATTGCAACA 
GGGACTGCCG ATGGGACTG3 TTIGTTTGGT GTGAACCTTC CAGCTGGGAC ^GTAAATGCG 
AATGAAAGAT TGAGAGCGTT AGGGAAAGAT COTGCTGGCA ATAC AAG TAG AGGGACAACC 
TTGGAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TAG TGGGAAT 
AGAACACAAG GATATCAAGT GAGAGGTACG GGTGAACTTG GCACCACrAT TGAAGTTCGT 
GGAACAGACG GAA CAGTTTT AGGGACCGGA A "AACTGGAC CGACTGG::A ATATACTGTG 
AGGTTAGGTT GAGGAAAAGG AA "AGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
AGTGGAGTTG AGAGTCAACG AACTACAGCT ATGACACCCG CTGATGTTAC CAGACCAACA 
ATTGGTGACA TTAGTGGAGA TT2AACAACT G2TTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATGAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
AGAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCGT 
GAAGATGCAA CCGTAGCAGC AGCAACTGTG AGGACTGTTA CAGGAA 



EF069-4 (SEQ ID NO:264) 

AGEA GANETLTAVA KNAfGTEXTP TTFQTPADEA TVTAPTITGV TGK :~ TAGYEV 
FGTADANATV EIRNAGGTVI GTGTADGTGA FTVTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV FGTADANATV EIRNAGGAVI GTGTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP TTFQTPADPN TPVATPIVET VTGSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG KYTVTLDSGT ATANQTL£'v"V AKNASGTESQ 
PATATTPADV TAPTVDNITG IISGSGYEITG TADPNTTIEV RDPSGAVIGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGN SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV NL PAG TAN AN ETLTALAKDP AGNTSTPTTF 
QTPADEWAP PSVDKVTGHT TQGYQVTGTA ELGTTIEVRA TDGTVLGTAT TGPTGQYTVT 
LASGKATAKQ T^WVAKNDT GLESQPTTAM TPADVTTPTI GDITGDSTTG YEITGTADPN 
TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDCNE SQPTEVTVPE 
DATVAAPTVT TVTGT 

EF070-1 ( SEQ ID NO:265) 

TAGGGGAAGC TAATGATCTT GGTATTTATC GTTTATTTTA AAGAAAAGAG GGACGATCAG 
ATGAAAAAGA AAATTGTTGA GGATTTTAAT CGGAAAAGTC AG C ATAAAAA ATGGACAAAA 
CGCAAGATGC TTAATTTAGC AATATCAAGT GGTTTATTAT TTACGTCATT AGCAATCCCT 
GTAAGTATAG CTGTTACCTC TGGCACAATG AGTGCATCAG CAGCGGTCTT GGATATCGAA 
CTATTATCAA ATGTTACGTC AAATAATGAC AGTGGCACTT CAACGAGTAA TGGTTGGACA 
GGCGCAAACC AAAATCAACC AGTTAATTTC ACGGTTTCTG GTGGCGCTTT AGCAGATGCT 
TCCGCTGTGT TTAGTGGACA AAAACAAGCG GTGTTAGTGG TTCCTCCTGA GTTAAGAGGA 
AATGTAGCTG CAGCAGGCAG CGCAGCAATC AATACCAATG TCACGATTGA TCTTTCAAAA 
GTTACTTTTT TGACTGCCGT TTTGAATGGA GCCAATGATT TAACCAATGT GATTACTCAA 
ATTACCAGTG GGGCGTTAGG GAATTTAACT GGTGTTGATA TTGATTTGAG GGAAGTGAAT 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes. 

CGTCAATTGG AATTAGTTAA TAACATTGAA AACTTAGGTG CTGCTTCATT TACAGCTCCG 
GAAACGTTAG CAGCTGACGG CTCATACATT AGTGCACCGA TTAGTGATGG TTTAGGGTTA 
GTTTTAGCCC AAAATGTTTC AAACATCTTA CAAGATTTGA ATGCGGCAGT TCAAGCTTTG 
GAGGCAAAAG GTACCAGTAT CCCAAGTAAT CTTGTCGCCG CAGCTATAAA TGCAGCCTTG 
CTTCCTGTCA AAGGCACGGT AAACGTGGCT GTTTCAGGTG CTTTGCCTTT ATTAGCGGTT 
GGTGGTTCAG GCGTAAATGA GTTAGTGGAT GCTTCTTTAC TAGGCACAAC CACGGTTACT 
TTACCAACTA CCGTTTCAAC ACCTCAAAAT TTATCCAATA ATTTAGATGC TCGTTTTGTA 
GGAACAGTCG TTCAAACAGA TCTTTTAGAC GTTAATTTAT TAGCAACAGC AGACGGTGTA 
TCCAACATTT ATTTTGCTGC AGGCACTACT AGTGAAGTAA CCGCACCAA: AATCACAGGA 
GTAACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATG: CAATGCCACG 
GTTGAAATCC GAAATGCAGG AGGCACCGTA ATAGGCACAG GTACCGC7GA TGGGACAGGA 
GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCAGGCGCCA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGNAC AGAAAGNACG CCAACAACGT TCCAAACIICC AGCGGATGAA 
GCAACCGTAA CCGCACCAAC AATCACAGGA GTGACAGGTA ATTCAACG Z C AGGTTACGAA 
GTTAAAGGAA CTGCCGATGC CAATGCCACG GTTGAAATCC GAAATGCAGG AGGCACCGTA 
ATAGGCACAG GTACCGCTGA TGGGACAGGA GCGTTTACAG TTACCGTTCC CGCAGGTGAA 
GCAGGTGCCA ATGAAACGTT AACCGCCGTA GCGAAAAACG CCAGCGGCAC AGAAAGTACG 
CCAACAACGT TCCAAACACC AGCGGATGAA GCAACCGTAA CCGCACCAAC AATCACAGGA 
GTGACAGGTA ATTCAACAGC AGGTTACGAA GTTAAAGGAA CTGCCGATG Z CAATGCCACG 
GTTGAGATCC GAAATGCAGG AGGTGCCGTG ATAGGTACAG GTACTGCTGA TGGGACAGGG 
GCATTTACAG TTACCATTCC CGCAGGTGAA GCAGGTGCGA ATGAAACGTT AACCGCCGTA 
GCGAAAAACG CCAGCGGTAC AGAAAGTACG CCAACAACGT TCCAAACCCC AGCGGATCCT 
AATACGCCCG TGGCGACGCC AATTGTTGAG ACTGTAACAG GTAGTACAAC AAAAGG C TAT 
GAGGTCAAAG GGACTGCTGA AGTTGGCACC ACCATTGAGG TTCGCGATGC AGCTGGCACG 
GTCCTTGGTA CTGCAACAAC TGGAACTGAC GGAAAATATA CAGTGAC'ITT AGATTCAGGA 
ACAGCAACAG CAAATCAAAC GCTGAGCGTT G TAG C G AAAA ACGCTAG-GG CACGGAAAGT 
CAACCAGCAA CGGCGACAAC ACCAGCTGAT GTCACTGCAC CAACAGTTGA TAACATCACA 
GGCAACTCTG GTTCGGGTTA TGAAATTACA GGAACAGCAG ACCCTAACAC AACAATCGAA 
GTTCGTGATC CATCTGGGGC AGTCATTGGT ACAGGTACCT CTGATGCGAA TGGTGATTTT 
ACTGTAACGC TACCAACGGG AACGACCAAT CCTGGGGATA CGTTAACACT GATTGGAAAG 
GATAACGCGG GAAATGAAAG TCAACCGACT GAAGTCCTTG TTCCTGCTCA TGCCACGGTT 
ACAGCACCAA CTGTAACAGG AGTAACAGGT AATTCAGTTG CTGGTTATCA GGTGACAGGC 
ACCGCTGATC CGAATGCTAC CATCGAAATT CGTGATGCAG ATGGGAACCT GATTGCAACA 
GGGACTGCCG ATGGGACTGG TTCCTTTGCT GTGAACCTTC CAGCTGG:AAC GGCAAATGCG 
AATGAAACAT TGACAGCGTT AGCCAAAGAT CCTGCTGGCA ATACAAGTAC ACCGACAACC 
TTCCAAACAC CAGCAGATGA AGTAGTGGCA CCGCCAAGTG TCGACAAAGT TACTGGGAAT 
ACAACACAAG GATATCAAGT GACAGGTACC GCTGAACTTG GCACCAC CAT TGAAGTTCGT 
GCAACAGACG GAACAGTTTT AGGCACCGCA ACAACTGGAC CGACTGGOCA ATATACTGTG 
ACGTTAGCTT CAGGAAAAGC AACAGCTAAA CAAACAGTGA ATGTAGTTGC TAAAAATGAT 
ACTGGACTTG AGAGTCAACC AACTACAGCT ATGACACCCG CTGATGTTAC CACACCAACA 
ATTGGTGACA TTACTGGAGA TTCAACAACT GGTTATGAAA TCACTGGGAC GGCGGACCCT 
AATACCACCA TTGAAGTACG GAACCCAGAT GGAACAATTA TTGGTACAAC GACAACGGAT 
GATCAAGGAA ACTTTACTGT GGACCTTCCA GCGGGAGCCG CTAATCCTGG TGATACATTA 
ACAGTTGTTG GAAAAGACGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 
TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 
TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 
GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCT.^C TGGGAAAGAA 
AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 
GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAGTAGGA ZC AGCAGAAGTT 
GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 
ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCCAA CGAAACAATA 
ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 
GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 
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A7CAC7GGAA CG3CGGAGCC AAA-AACCA7C A77GA7GTCC GTGACGCAGA CGGAACCATC 
A7TGC7G3TA CAA3TGCTAA. C3AAACCGGC CAA7A7ACGG TGACTC. ./ AGCTGGCGTA 
37GACAC 3 AG GAGAAACGA7 CACGAT7AT7 A3 3AAAGATG GCGCAGG7AA 7GAAAGTCAA 
:CA3CTA:AG CCG7TAT7CC AGCGGATGT7 G77TTAGCGG CGCCAAC7A7 7ACGAAGGTT 
3AAG GAAACA. AAG3CAA7G3 G 7A7AC AG73 AC7GGAA37G CTGATC _ 7G7CACGGTT 
GAATT77ACA A7A3CAG7GA A:AATTA7TG GCAAG7G3C A ATACAA'.:7A:: 733AGG7ACC 
77CTCCG77C A.7A77GCAGC A 3GG 77 AG 2 A A-iGAAAA/vj AA-M.CGT - _ > — ? - Al. ^ /\A^ c 
A7A3A7ACAC AAG3AAATGT GAGTCCTAAA A3CACAT7TA TGACGC7AG3 C 3ATATTACG 
G j AG AACC AG AGA7TAAAA7 7GCGGCACGA AZTGTTTCTT CAGTTTCAGG AAGGTCTAAA 
37C3GCTACC T CATC AAA 3G AACAG CTGAA CJAAAC3GAA TCATTCAAA 7 7AG7AACCGA 
C7A77AAGAA GT3TGATT7C T3TA3GTGCC A 3CGAT3CTG AAGGCAAC 77 C GCTATCCAA 
77AACAGCGG GACAAGCGAC T3CTCAACAA A3TTTACTTG CGACAGCTA ; C3ATGGCGCA 
G3ACATTACA GTACGGCTAC AACCTTCA7G A73CCA3CCG ACCCAACCA_-\ 7GCTGGAGGA 
G 3CAATGGTA ACA3TGGC3G AAATAACGGC AA7ACAGGCG GCAATACAC - AAACAATGGC 
GCAACTGGCG GGAATAAT3G GAATGGT7CA AACACA3GTT CAAATC 7*777 7GGAGGTTCT 
G3TTTAGGCA CAA IAGGTTC TGGCTTAGGT T.:ACTAGGCA ATGGCCTf.G3 7ACAAATGGT 
AGTGGC7ACC ACCCTAAA3T AAGTACCATC A 37TATGGCA CTGGAAAT3A C3GGAAAACA 
GGCTACTTAC CTAGCACA3G TGAAAAAGAG TC7TCAGCCG TGACAACA-A3 77TGTTTGGC 
GCCTTTG7CG CACTCCTT3C GAGCATGGGA A7CATCAAAC GCAAACGTAA A A AC TAG 

EFC70-2 (SEC 2 D NO:266) 

K KKKIVEDFNF. FSQHKKWTKF FKLNLAISSG LLFTSLAIPV 

SIAVTSGTIS ASAAVLDIEL LSNVTSNNDS GTSTSKRWTA ANQNQPVNFT VSGGALADAS 
AVFSGQKQAV 7WPPELRGN VAAAGSAAIN TNVTIDLSKV TFLTAVLI7AA 7TLTNVITQI 
TSGALGNLTG VC3 DLTEVNR C.LELVNNIEN LGAASFTAPE 7LAADGS7 1 3 Ar I SDGLGLV 
LAQNVSNILQ DLNAAVQALE AKGTSIPSNL VAAAINAALL PVKGTVNV AV SGALPLLAVG 
GSGVNELVDA SLLGTTTVTL FTTVSTPQNL S NNLDARFVG TWQTCLLC 7 KLLATADGVS 
NIYFAAGTTS EVTAPTITGV TGNSTAGYEV EGTADANATV EIRNAGGTVI G7GTADGTGA 
FTVTVPAGEA GANETLTAVA FNASGTEXTP TTFQTPADEA TVTAPTITGV TGNSTAGYEV 
FGTADAHATV EIRNAGGTVI GTGTADG7GA F7VTVPAGEA GANETLTAVA KNASGTESTP 
TTFQTPADEA TVTAPTITGV TGNSTAGYEV EGTADANATV EIRNAGGAV I G7GTADGTGA 
FTVTIPAGEA GANETLTAVA KNASGTESTP 7TFQTPADPN TPVATPIVET V7GSTTKGYE 
VKGTAEVGTT IEVRDAAGTV LGTATTGTDG FYTVTLDSGT ATANQTL S*~7 AENASGTESQ 
PATATTPADV TAPTVDNITG NSGSGYEITG 7ADPNTTIEV RDPSGAV IGT GTSDANGDFT 
VTLPTGTTNP GDTLTVIGKD NAGNESQPTE VLVPADATVT APTVTGVTGIC SVAGYQVTGT 
ADPNATIEIR DADGNVIATG TADGTGSFAV KLPAGTANAN ETLTALAKD? AGNTSTPTTF 
QTPADEWAP PSVDKVTGNT TQGYQVTGTA E3GTTIEVRA TDGTVLGTAT 7GPTGQYTVT 
LASGKATAKQ TVHWAKNDT GLESQPTTAM TPADV7TPTI GDITGDSTTG YEITGTADPN 



TTIEVRNPDG TIIGTTTTDD QGNFTVDLPA GAANPGDTLT WGKDGDG7G SQPTEVTVPE 
DATVAAPTVT TVTGTTATGY QVTGTAEPNV TIEIHNEAGL VIATGTTDGA GAFTITLPTG 
TATANEALTA IAKDAAGKES NPTAFKTPAD PDAPVATPTV DKITGSTT7 3 YQWGAAEVG 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP GKASANETIT WAKNAT 3 ?' E SQPATATTPV 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI DVRDADGTII AATTANETGQ YTVTLPAGW 
TPGETITIIS KDGAGNESQP ATAVIPADW LAAPTITKVE GNKANGY7V7 GTADPNVTVQ 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT KKETLTALTT DTQGNVSPF7 TFMTPADITG 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP NRIIQISNRL LRSVIAVGAT DAEGNFAIQL 
TAGQATAQQS LLATATDGAG HYSTATTFMT PADPTNPGGG NGNTGGN7G:: 7GGNTGNNGA 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS LGNGLGTNGS GYHPKLSTIS YGTGNHGKTG 
YLPSTGEKES SAVTTSLFGA FVALLASMGI IKRKRKN 



EF070-3 (SEQ ID KO:267) 



CGG TGACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 
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GAAGATGCAA CCGTAGCAGC ACCAACTGTG ACGACTGTTA CAGGAACAAC TGCCACTGGG 

TATCAAGTAA CCGGCACGGC AGAGCCAAAT GTCACCATTG AGATTCACAA TGAAGCAGGT 

TTAGTTATTG CTACGGGAAC GACTGATGGT GCTGGCGCAT TTACAATCAC TCTTCCGACG 

GGCACAGCAA CAGCTAACGA AGCCTTAACT GCCATTGCGA AAGATGCTGC TGGGAAAGAA 

AGTAATCCGA CTGCTTTCAA AACACCTGCT GATCCAGATG CACCAGTCGC GACACCTACT 

GTTGACAAAA TCACTGGTAG CACGACAAAC GGCTATCAAG TAG T AG G A G 1 J AGCAGAAGTT 

GGTACAACAG TTGAGGTGCG TGACGCCGAT GGCACAGTCC TTGGCATGGC AACTACTGGA 

ACTGATGGCA AATACACAGT GACTTTAGAG CCAGGGAAGG CCTCAGCTAA CGAAACAATA 

ACTGTCGTAG CGAAAAATGC AACAGGAAAA GAAAGTCAGC CAGCTACAGC AACTACACCA 

GTCGACTTAG CCACACCAAC CATTGATTCT ATTACCGGAA ATTCTAGTAA AGGTTACGAA 

ATCACTGGAA CGGCGGAGCC AAAAACCACT ATTGATGTCC GTGACGCAGA CGGAACCATC 

ATTGCTGCTA CAACTGCTAA CGAAACCGGC CAATATACGG TGACTCTA:C AGCTGGCGTA 

GTGACACCAG GAGAAACGAT TACGATTATT AGCAAAGATG GCGCAGGTAA TGAAAGTCAA 

CCAGCTACAG CCGTTATTCC AGCGGATGTT GTTTTAG CGG CGCCAACTAT TACGAAGGTT 

GAAGGAAACA AAGCCAATGG CTATACAGTC ACTGGAACTG CTGATCCAAA TGTCACGGTT 

CAATTTTACA ATAGCAGTGA ACAATTATTG GCAAGTGGCA ATACAACTAC TGGAGGTACC 

TTCTCCGTTC ATATTGCAGC AGGGTTAGCA ACAGAAAAAG AAACGTTAAC CGCACTAACC 

ACAGATACAC AAGGAAATGT GAGTCCTAAA ACCACATTTA TGACGCCAGC CGATATTACG 

GGAGAACCAG AGATTAAAAT TGCGGCACCA ACTGTTTCTT CAGTTTTAGG AACGTCTAAA 

GCCGGCTACC TCATCAAAGG AACAGCTGAA CCAAACCGAA TCATTCAAAT TAGTAACCGA 

CTATTAAGAA GTGTGATTGC TGTAGGTGCC ACCGATGCTG AAGGCAACTT CGCTATCCAA 

TTAACAGCGG GACAAGCGAC TGCTCAACAA AGTTTACTTG CGACAGCTAC CGATGGCGCA 

GGACATTACA GTACGGCTAC AACCTTCATG ACGCCAGCCG ACCCAACGAA TCCTGGAGGA 

GGCAATGGTA ACACTGGCGG AAATAACGGC AATACAGGCG GCAATACA ";G AAACAATGGC 

GCAACTGGCG GGAATAATGG GAATGGTTCA AACACAGGTT CAAATCCAAA TGGAGGTTCT 

GGTTTAGGCA CAACAGGTTC TGGCTTAGGT TCACTAGGCA ATGGCCTC- A5 TACAAATGGT 

AGTGGCTACC ACCCTAAACT AAGTACCATC AGTTATGGCA CTGGAAATAA CGGGAAAACA 
GGCTACT 



EF70-4 (SEQ ID NO:268) 
DGDGNE SQPTEVTVPE 

DATVAAPTVT TVTGTTATGY QVTGTAEPNV 
TATANEALTA IAKDAAGKES NPTAFKTPAD 
TTVEVRDADG TVLGMATTGT DGKYTVTLEP 
DLATPTIDSI TGNSSKGYEI TGTAEPKTTI 
TPGETITIIS KDGAGNESQP ATAV I PAD W 
FYNSSEQLLA SGNTTTGGTF SVHIAAGLAT 
EPEIKIAAPT VSSVLGTSKA GYLIKGTAEP 
TAGQATAQQS LLATATDGAG HYSTATTFMT 
TGGNNGNGSN TGSNPNGGSG LGTTGSGLGS 
YL 



TIEIHNEAGL VIATGTTD3A GAFTITLPTG 

PDAPVATPTV DKITGSTTIJG YQWGAAEVG 

GKASANETIT WAKNATGKE SQPATATTPV 

DVRDADGTII AATTANET3 2 YTVTLPAGW 

LAAPTITKVE GNKANGYT r T GTADPNVTVQ 

EKETLTALTT DTQGNVSPKT TFMTPADITG 

NRI IQISNRL LRSVIAVGA? DAEGNFAIQL 

PADPTNPGGG NGNTGGNNGNT TGGNTGNNGA 

LGNGLGTNGS GYHPKLST IS YGTGNHGKTG 



EF071-1 ( SEQ ID NO:269) 

TAAGTAGAAG TGGTCGGGAC AAACGTAGAA 
GTCCCGCCAT TTATCTGCAG GTTTAAGCCG 
ATGGCTTTTT TAAGAAAGGA GCATGCTATG 
GTGATTGGTT TAAGTTTAAC GATTCCGATG 
CCAATCAACT TTACTTATTT TCCCGGCTCT 
TCTGGAAACG AGCGGAACCT AGGACCACAC 
CGAAATTGGT CAAATGCTTA TGTCTCATAT 



CTTTCGCTGA TTGCCGAA 3A AATTACTTCT 
TGGAAGGGAA GTTATTTToA CTTTCCTTTC 
TTTAAAAAAT TAATGATTTA ACTTGCTTTA 
ACGGCTTNCG CTTACACCAT CGAAGCGGAT 
GCAAGCAATG AATTAATTGT TTTACATGAA 
AGTTTAGACA ATGAAGTG CTATATGAAA 
TTTGTCGGAT CTGGTGGA.:G AGTGAAACAA 
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G7AGG7777T TAG77AA77A AAAAGCCTAT 

GTGGCGACA.T _ Trt-AAAj-*-_-w-.--. _ - .--.TGCTGCC 

AA7ATTGGTG 7TGATTT77 7 77TGGACGAT 

AAAATTGG7G GGGAGATCAT 

GGA77AGTAA AGCGCAT777; 7A77JAGATTT 
GGT7A.T7A.TT CAo_C/-iGG _ 



EFG71-2 { SEC 2D NC:270; 
MF KKLMIQLALV 

I3LSLTIPMT AXAY7IEADP INFTYFPGSA SNELIVLKES GNERNLG??-: 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA GSLANQ-KAYA QIELARTLC 
VMLARDLAQN IGADFSLDOG TGYGIV7HDV; 1 7X!7,\7\7DKT 7PYGYLA77 
TGVSXTGETG HYSAR 

EF071-3 { SEQ 2D NO:271) 

G T77AAAAAAT TAA7GA77CA ACTTGC777A 

GTGATTGGTT TAAG7TTAAC GATTCCGA7G ACGGCTTNCG CTTACAC 7 A 7 CGAAGCGGAT 

CCAATCAACT TTAC7TATTT TCC7GGCTC7 GCAAGCAA7G AATTAAT7G7 TTTACATGAA 

TCTGGAAACG AGCGGAACCT AGGACCACAC AGT7TAGACA ATGAAGTGGC C7ATATGAAA 

CGAAATTGGT CAAA7GCTTA TGTCTCATAT T7TG7CGGAT CTGGTGGA7' G AG7GAAACAA 

TTAGCTCCTG CTGGCCAAAT TCAATATGGC GCAGGTTCTT TAGCTAAT7A AAAAGCCTAT 

GCGCAAATCG AATTGGCTCG AACGAATAAT GCGGCGACAT TTAAAAAiv 5 A TTATGCTGCC 

TATGTTAAT7 TGGCCCGTGA TTTGGCTCAG AACATTGGTG CTGATTT7 77 TCTGGACGAT 

GGAACAGGTT ATGGCA7AG7 CACTCATGAT TGGATTACAA AAAATTGGTG GGGAGATCAT 

ACAGATCCTT ATGGT7ATTT AGCGCGTGGG GGATTAGTAA AGCGCAT7 7G CACNAGATTT 
ACAACGGGCG TTTCNGNAAC AGGTGAGACT GGTCATTATT CAGCCAGG7 

EF071-4 ( SEQ ID NO: 272) 

F KKLMIQLALV 

1GLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGP'-iS LDNEVAYMKR 
NWSNAYVSYF VGSGGRVKQL APAGQIQYGA GSLANQKAYA QIELARTI 77 A ATFKKDYAAY 
VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYLARC-G LVKRIGTRFT 
TGVSXTGETG HYSAR 

EF072-1 (SEQ ID NO:273) 

7AATCAATGA AAAACGCACG TTGGTTAAGT A7TTGCGTCA TGCTACTC GC TCTTTTCGGG 
TTTTCACAGC AAGCATTAGC AGAGGCATCG CAAGCAAGCG TTCAAGTTAC GTTGCACAAA 
TTATTGTTCC CTGATGGTCA ATTACCAGAA CAGCAGCAAA ACACAGGGGA AGAGGGAACG 
CTGCTTCAAA ATTATCGGGG CTTAAATGAC G7CACTTATC AAGTCTA7GA TGTGACGGAT 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAGCG TCAATTAGCA 
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACAGAC AATAAATGGA 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC 
TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG AAAAAG 7 TAG CAACCTAGTA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CGCATAT7CA TTTATATCCA 
AAAAATGAAG AAAATGCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG ATACAAC 3TT GACACTTTTA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTTA CACACTGACG 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT 



7 LDNEVAYMKR 
A ATFKKDYAAY 
7 LVKRIGTRFT 
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CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAACC TGACACTGCG 
ATTAACAACG AAGGACAATT AGTCACGGAC AAADATACCT TGACTAAAAG AGCCACAGTT 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACGTTGCCA 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGG2AATACC TCAATGAA/ -'Z AGCAAACGGG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATC A AGCCGGTGAA 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG GAAGAAATC'T CTGCACCAAA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC 
GAACGGACAA CGAACAGCAC CGTTACATGT AATCAATAA 



EF072-2 (SEQ ID NO:274) 

MKNARWLSI CVMLLALFGF SQQALAEASQ ASVQVTLHKL LFPDGQLPEQ QQNTGEEGTL 
LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKPIA EDKTQTINGE 
DGWSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLW ILPVQDPQGC SLTHIHLYPK 
NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFRLSD KADTTLTLLP 
ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQHK Li EKTAEPDTAI 
NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG EYLNETANGY 
RWQKEKALAK KFTSNQAGEF SVKGXKRWFV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE 
RTTNSTVTCN Q 

EF072-3 (SEQ ID NO:275) 

ATTACCAGAA CAGCAGCAAA AC AC AGG G G A AGAGGGAACG 

CTGCTTCAAA ATTATCGGGG CTTAAATGAC GTCACTTATC AAGTCTATGA TGTGACGGAT 
CCGTTTTATC AGCTTCGTTC TGAAGGAAAA ACGGTCCAAG AGGCACAG ZQ TCAATTAGCA 
GAAACCGGTG CAACAAATAG AAAACCGATC GCAGAAGATA AAACACA^AC AATAAATGGA 
GAAGATGGAG TGGTTTCTTT TTCATTAGCT AGCAAAGATT CGCAGCAACG AGATAAAGCC 
TATTTATTTG TTGAAGCGGA AGCACCAGAA GTGGTAAAGG AAAAAGCTAG CAACCTAGTA 
GTGATTTTGC CTGTTCAAGA TCCACAAGGG CAATCGTTAA CGCATATTC A TTTATATCCA 
AAAAATGAAG AAAATGCCTA TGACTTACCA CCACTTGAAA AAACGGTACT CGATAAGCAA 
CAAGGCTTTA ATCAAGGAGA GCACATTAAC TATCAGTTAA CGACTCAGAT TCCAGCGAAT 
ATTTTAGGAT ATCAGGAATT CCGTTTGTCA GATAAGGCGG ATACAAC ^TT GACACTTTTA 
CCAGAATCAA TTGAGGTAAA AGTGGCTGGA AAAACAGTTA CTACAGGTVA CACACTGACG 
ACGCAAAAGC ATGGATTTAC GCTTGATTTT TCAATTAAAG ACTTACAAAA CTTTGCAAAT 
CAAACAATGA CTGTGTCGTA TCAAATGCGT TTAGAAAAGA CCGCTGAAC- TGACACTGCG 
ATTAACAACG AAGGACAATT AGTCACGGAC AAACATACCT TGACTAAAAG AGCCACAGTT 
CGTACAGGCG GCAAGTCTTT TGTCAAAGTT GATAGTGAAA ATGCGAAAAT CACCTTGCCA 
GAGGCTGTTT TTATCGTCAA AAATCAAGCG GGGGAATACC TCAATGAAAC AGCAAACGGG 
TATCGTTGGC AAAAAGAAAA AGCATTAGCT AAAAAATTCA CGTCTAATC A AGCCGGTGAA 
TTTTCAGTTA AAGGNNTTAA AAGATGGCCA GTACTTCTTG GAAGAAATCT CTGCACCAAA 
AGGTTATCTT CTGAATCAAA CAGAAATTCC TTTTACGGTG GGAAAAAATT CTTATGCAAC 
GAACGGACAA CGAACAGCAC CGTTACATGT A 

EF072-4 ( SEQ ID NO:276) 

QLPEQ QQNTGEEGTL 

LQNYRGLNDV TYQVYDVTDP FYQLRSEGKT VQEAQRQLAE TGATNRKP I A EDKTQTINGE 

DGWSFSLAS KDSQQRDKAY LFVEAEAPEV VKEKASNLW ILPVQDI'QC-Q SLTHIHLYPK 

NEENAYDLPP LEKTVLDKQQ GFNQGEHINY QLTTQIPANI LGYQEFF.LSD KADTTLTLLP 

ESIEVKVAGK TVTTGYTLTT QKHGFTLDFS IKDLQNFANQ TMTVSYQMr.L EKTAEPDTAI 

NNEGQLVTDK HTLTKRATVR TGGKSFVKVD SENAKITLPE AVFIVKNQAG EYLNETANGY 
RWQKEKALAK KFTSNQAGEF SVKGXKRWPV LLGRNLCTKR LSSESNRNSF YGGKKFLCNE 
RTTNSTVTC 
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EFC73-: (sec :d :v 



TAAATGAACA 


AAT ^ AAA TAG 




AT - GG G TA * A 






:;gttaatc 


AT7GCTGTCG 








* * ' - TG o i ~. ^ 




7TTTAGTA 




A. T C G AAAAAA 


G AAAAATG G 


G C C G A C AAAA 


G C G AT C A" ■. _ 




GTTACTTA 






T * ATC^; i ^/-^to 


TTGGGG ^ TAT 


C T C C AC A A _- A 




TTGATTTT 


TTGAGAAGTA 


CAATGAGCA " 


A 3CC AAAAAA 


CAAA.TCATAC 


AATTGCAA17 


aaacatgaat 


CG7TCAACTA 


AATTACGGG 3 


GATTGACTTA 


3GTAATGATA 


CTACGAAGG'I 




CTAAAGCT 


CT3TTTAAAG 


AGTTAGTGAA 


AGAACCTAAA 


.AAGTTACACT 


tagccaat:; 


CTTTCTCTAT 


ACACATTTAC 


CAAATATCGT 


TGACTTAACA 




TAGAAATC:-; 


A 


AAGAC GAA 


GTAAAAAACA 


AACAAACGTA 


TGAAAAAT7A 


G AA G AAA. G C G 


CACAAATCAT 


t;: 


ACCAATTG 


TCAAAATTAG 


TTAAAAATCA 


TTATGAGGAA 


AT 3GTTTCCG 


ATGACTTA3A 




ATTTAGAT 


GTCGAAATGG 


GGATC GCTAA 


AAGCAGCTTG 


T I 3CAAAAAG 


ctggaact:-; 




AATCACCT 


caagtaaacg 


AAGAC GAGCA 


ATAA 










EF073-2 (SEQ ID NO:27* 












mtcklntkll: 


GYILLGALI I 


AVAREYGFFA 


FVILVGFLVF 


vlyrkkk::a.~ 




.SDQMPYLT 


KDKEAKYREL 


GLSPQEIDFF 


RSTMSTAKKQ 


IIQLQENMNR 


STKLRA:~:_r 


. k: 


^TTKVSKAL 


FKELVKEPKK 


LHLANHFLYT 


HLPNI VDLTS 


KKLEISQHEV 


KNKQTYEFGT 


" Eli 


AQIIDQLS 


KLVKNDYEEI 


VSDDLDDLDV 


EMSIAKSSLS 


QKAATEESPQ 


VNEDQQ 







EF073-3 (SEQ ID NO:279) 

CT ATCGAAAAAA GAAAAA TGCC GC CG ACAAAA GCGATCAAAT GCCTTACT7A 
ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAAA A A TTGATTTT 
TTCAGAAGTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATTGCAAGA AAACATGAAT 
CGTTCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTGTAAAGCT 
CTGTTTAAAG AGTTAGTGAA AGAACCTAAA AAGTTACACT TAGCCAAT3A CTTTCTCTAT 
ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACATT TAGAAATGGA ACAACACGAA 
GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATGAT 7GACCAATTG 
TCAAAATTAG TTAAAAATGA TTATGAGGAA ATCGTTTCCG ATGACTTAGA CG ATTTAGAT 
GTCGAAATGT CGATC GCTAA AAGCAGCTTG TCGCAAAAAG CTGCAACTGA GGAATCACCT 
CAAGTAAACG AAGACCAGCA AT 

EF073-4 t SEQ ID NO:280) 

YRKKKNAA DKSDQMPYLT 

KDKEAHYREL GLSPQEIDFF RSTMSTAKKQ I IQLQENMNR STKLFAI7LR NDTTKVSKAL 
FKELVKEPKK LHLANHFLYT HLPNIVDLTS KKLEIEQHEV KNKQTYEKGE ESAQI IDQLS 
KLVKNDYEEI VSDDLDDLDV EMSIAKSSLS QKAATEESPQ VNEDQQ 

EF074-1 ( SEQ ID NO:281) 

TAAAGGAGTT CTCAAAAAAT GAAGCTAAAA AAAATAATTC CTGCTTTTCC CCTTCTTTCA 
ACCGTTGCAG TTGGCTTGTG GTTAACGCCT ACTCAAGCTT CTGCAGATGC TGCGGATACG 
ATGG TAG ATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GGCATAAC7G GGCCTCAAAA 
GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 
GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 
TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 
CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG C AC AT ATT C A ATTAGTCAAA 
GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 
GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAA;":;:A AACCGTCATC 
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CCTGCTACGT TG AAAA TAG T CAAAGACCAT TATCGAGCAC AA G G AAAA A A TTTCATCATT 

ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 

TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 

GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATA C GAG 
TTCCTCTATN ATATT 

EF074-2 (SEQ ID NO:282) 

MKLKK IIPAFPLLST VAVGLWLTPT QASADAADTM VDISGKKVLV GYWHNWASKG 
RDGYKQGTSA SLNLSEVNQA YNWPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 
GRAVLLALGG ADAHIQLVKG DEQAFANEI I RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 
ATLKIVKDHY RAQGKNFI IT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
WVDEVMTWVA QSNDALKYEF LYXI 

EF074-3 (SEQ ID NO:283) 

TGC TGCGGATACG 

ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GTTGGATATT GG C AT AA C TG GGCCTCAAAA 

GGACGCGATG GTTACAAACA AGGAACATCA GCATCACTAA ACCTTTCAGA AGTAAATCAA 

GCCTACAATG TCGTACCGGT TTCCTTCATG AAAAGCGATG GCACGACACG GATTCCTACG 

TTCAAGCCTT ATAACCAAAC GGACACTGCC TTCCGACAAG AAGTCGCACA ATTAAATAGT 

CAAGGTCGCG CAGTTTTATT GGCACTTGGT GGAGCAGATG CACATATTCA ATTAGTCAAA 

GGCGATGAAC AAGCCTTTGC GAATGAAATC ATTCGTCAAG TGGAAACATA CGGCTTTGAT 

GGTTTAGACA TCGACTTAGA GCAATTGGCG ATTACTGCTG GCGACAAC "A AACCGTCATC 

CCTGCTACGT TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT 

ACGATGGCAC CAGAATTCCC TTATTTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT 

TCCCTAAATG GTTATTATGA TTACATTGCC CCACAATTAT ATAACCAAGG CGGCGACGGT 

GTCTGGGTTG ATGAAGTTAT GACTTGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG 
TTCCTCT 

EF074-4 (SEQ ID NO:284) 
AADTM VDISGKKVLV GYWHNWASKG 

RDGYKQGTSA SLNLSEVNQA YNWPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ 

GRAVLLALGG ADAHIQLVKG DEQAFANEI I RQVETYGFDG LDIDLEQLAI TAGDNQTVIP 

ATLKIVKDHY RAQGKNFI IT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV 
WVDEVMTWVA QSNDALKYEF LY 



EF075-1 (SEQ ID NO:285) 

TAACCTATAA GAAAAAAATC ACAACCTGTG 
GGGAAGAAAA TTTTTGCCAT TATCNTTGGA 
GGAATGGGAG CAAAACTTTA TTGGGATGTT 
GTAGAACGAT CTAAAAAAAG TCAGGTCAAT 
TTATTAGGGA TTGATACAGG CGATGATGGG 
ATTGTTGCAA CAGTTAATCC TCGTGACAAG 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC 
TTGGATGGTG AACAAGCACT CTCCTATTCA 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG 
CTTAACAGCG TAAGCAACTA TCAAGAAATT 
GATTTAAGTT TTGATGACAT GAAAAAAATT 



ATAAATTATT GGAGGNAAAA TATGTCAAAA 
ATTATCTTGG NTCTATTTCT TGCAGTTGTT 
TCTAAATCAA TG G A T AAAA C CTATGAAACA 
TTAAACAATA AGGAGCCTTT TTCTGTTTTA 
CGTGTCGAGC AAGGTCGTTC GGATACAACA 
CAAACAACCT TAGTCAGTCT TGCTCGCGAT 
CAAGATAAAT TGAATCACGC CTATGCTTTT 
GAAAACTATT TAAACAT^CC TATTAATCAT 
GAATTAGTCA ACGCGGTTGG C GG AATCGAA 
GGATATGATT TTACGATTGG TAAAATTTCA 
AGAATGCGTT ACGAAGACCC TAATGGTGAC 
ATTGAAGGCA TCGTCCAAAA AGTCTTAAGT 
TTAACAGCTG TTTCTGATAA TATGAAGACA 
GCCTTAGATT ATCGCAG7GC CTTTGGTAAA 
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G TG AAA C AA G ACCAACT72A AGGTAC7GG7 7TTA7GGAAG ATGGT3T: 7 2' 77ATCAACGT 
^ -i vjvj ATG/^A 1 ^. ^u-ivj n^.TT A^-iU ^wGTGTCC/iA. '22AA2A3T7GA AAA-A77A_- ■. _ T 7VA.7ACAAAA 
TAA 



EFC75-2 (SEQ IG KC:255; 

MSKG KKTFATIXGI IIXI.FLAWG KGAXLYWD7S KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL EGI DTGDDGR VEQGRSDTTI VATVKPREEQ TTLVSLARDT 
YVDI PGQGKQ 2KLNHAYAFG GASLAMDTVE NYL/JI PINKY V£ INMAG7FE L7NAVGGIEV 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRF7I EG IVQKVLSL 
KSVENYQEIL TAVSDNMKTD L3FDDMKKI A LDYRSAFGKV KQDQLQG'" GF M^DGVSYQRV 
DEQELTRVQQ- ELKMQLNTK 



SF075-3 ( SEQ ID NO:287) 



ACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAAGA 

GTAGAACGAT CTAAAAAAAG TCAGGTCAAT TTAAACAATA AGGAGCCT7T TTCTGTTTTA 
TTATTAGGGA TTGATACAGG CGATGATGGG CGTGTCGAGC AAGGTCG77G GGATACAACA 
ATTGTTGCAA CAGTTAATCC TCGTGAGAAG CAAACAACCT TAGTCAGT2T 7G7TGGCGAT 
ACCTATGTTG ATATTCCAGG TCAAGGAAAA CAAGATAAAT TGAATC A CGC CTATGCTTTT 
GGTGGCGCAT CTTTAGCAAT GGACACAGTT GAAAACTATT TAAACATA7G TATTAATCAT 
TATGTTTCAA TTAATATGGC TGGTTTAAAA GAATTAGTCA ACGCGGT7GG CGGAATCGAA 
GTGAACAATA ATCTGACTTT TTCTCAAGAC GGATATGATT TTACGAT7GG T.AAAATTTCA 
TTGGATGGTG AACAAGCACT CTCCTATTCA AGAATGCGTT ACGAAGAGSC TAATGGTGAC 
TACGGCCGCC AAGAACGTCA AAGAAAAGTG AT7GAAGGCA TCGTCCAAAA AGTCTTAAGT 
CTTAACAGCG TAAGCAACTA TCAAGAAATT TTAACAGCTG TTTC TG A ~ ' AA TATGAAGACA 
GATTTAAGTT TTGATGACAT GAAAAAAATT GCCTTAGATT ATCGCAGTG7 CTTTGGTAAA 
GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGT77G CTATCAACGT 
GTGGATGAAC AAGAATTAAC TCGTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA 



EF075-4 ( SEQ ID NO: 288) 
KLYWDVS KSMDKTYETV 

ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR 
YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE 
NNNLTFSQDG YDFTIGKISL DGEQALSYSR 
NSVSNYQEIL TAVSDNMKTD LSFDDKKKIA 
DEQELTRVQQ ELKNQLNTK 



VEQGRSDTTI VATVNPREK 2 TTLVSLARDT 

NYLNIPINHY VSINMAGLKE LVNAVGGIEV 

MRYEDPNGDY GRQERQRKV I EG IVQKVLSL 

LDYRSAFGKV KQDQLQGTGF MQDGVSYQRV 



EF076-1 (SEQ ID NO:289) 

TAGAAAATAA CAGAGGAGCT GAAGGAAATG 
AGCATTGCTG CAGTTGCAAG TGTCTCTGTT 
AAGGTATCTC ATGTTTCCAA TCGTTATAAA 
GGAAACCAAA AATTATTATC GATTGTCGAT 
TTAAATGTTG TGGATCGTGT GAAAGATGGC 
GTTAAAGACA ATACAGATTC TTTAAAAGAA 
AAGTTAAAAA AGTGGCCTAG GCCATCTTTT 
TAA 



AAAGCATCAA CAAAAATTGG TATCGGTTTA 
GCAGTCATCG C TT C TG AA AA AATTATTAAG 
GTTAAAAAGT TTGTAGACGA TAAATTTGAT 
GATTTATCCG ATGATGAA7T AGATTCTGTT 
GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 
CGCTTTTTCA CATTTAT7GA ASATGCAATG 
TTTTATAAAA ATAATTC77T TGTTTCAACA 
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EF076-2 (SEQ ID NO : 2 9 0 ) 

MK ASTKIGIGLS IAAVASVSVA VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 
NQKLLSIVDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 
LKKWPRPSFF YKNNSFVST 

EF076-3 ( SEQ ID NO;291) 

CATCG CTTCTGAAAA AATTATTAAG 

AAGGTATCTC ATGTTTCCAA TCGTTATAAA GTTAAAAAGT TTGTAGACGA TAAATTTGAT 

GGAAACCAAA AATTATTATC GATTGTCGAT GATTTATCCG ATGATGAATT AGATTCTGTT 

TTAAATGTTG TGGATCGTGT GAAAGATGGC GGTTCAAAAT TAGCTGAATA TGGCGAAAAA 

GTTAAAGACA ATACAGATTC TTTAAAAGAA CGCTTTTTCA CATTTATTGA AGATGCAATG 

AAGTTAAAAA AGTGGCCTAG GCCATCTTTT TTTTATAAAA ATAATTCTT 

EF076-4 (SEQ ID NO:292) 

VIASEKIIKK VSHVSNRYKV KKFVDDKFDG 

NQKLLSIVDD LSDDELDSVL NWDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAMK 
LKKWPRPSFF YKNNS 

EF077-1 (SEQ ID NO:293) 

TAATGTAAAG TGAATGATGG GAGAGAAAAA GAGATGAAGC ATGTAACAAA ATTGGGGATT 
ACAATTATAA CAGGAGTTTT GGCATTATTA TTTGAATTTA TTTTAC ATCA GCCGAATTGG 
GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAATGATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAGCGATTAC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTTAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAACTAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CATTAACAGG AGAATCAAAA 
C C AATTG AAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGCAGATCG CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTCGTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTCTGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTGGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGC TAG ATCA 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AGATACGGGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT AC TTTTAC AG ACACTGTACG CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTGC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTGCT 3ACGT TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAAGATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAATGTT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTCGTGGA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCCAGTAA 
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ef c"?-: ;se; ie nc : ) 

:-2<HVTKLGIT IITGYEALEF EFILH^PNWA YGIILITGSV I^L:~* :FWEMI 

QTLREGKYGV LILAITAIVA TLAVGE YWAS LMILIMLTGG CSLELYAAGK ANQELKSLLD 

NSFQKAHRLK GE'vuEEVSVE EI^vGEELYY FPGELYPYD3 LVKTGTSTVE ESSLTGESKP 

IEKNP3DELM SGSVNGDGSL KMVAEKTVAD SQY£TIVNLV KESAAKPArlF VKLADRYAVP 

FTLVAYLIAG VAWFVSKSPT RFAEVLWAS FCPLILSAPI ALVAGK3FGS RHGWIKSGT 

MVEKLASAKT IAFGKTGTIT QGQLSYEQVQ FINAGITAAE LYGLAA5YEQ ESSHILARSI 

VAYARKQDVP LKNITELAEY SGAGVXAFVD GAEIRVGKKN FVT: EFFETE KIDKTTIHIS 

RNGTYLGRIT FTDTVEPEAK ETMEKLHQLH LQ'RI LKLTGD QESVA.r FI AA EVGITEVHGE 

CLPQDKLTIL KELPKENKPV IMVGDGYNDA FSLAAADVGI AMGA-iGATAA S FT AD W ILK 

DDLSKVSQAV H IAQDTMKIA KQSVLIGIFI CYLLMLIAST GIIPAFIGAM LQEWDTVSI 
LSALRARRIG £ 

EF077-3 (SEC ZD NO:295) 

TCA GCCGAATTGG 

GCGTATGGCA TTATTTTAAT AACAGGTTCT GTAATGGCGT TAAT3ATGTT CTGGGAAATG 
ATTCAAACCT TACGTGAAGG AAAATATGGT GTCGATATTT TAG EG ATT AC CGCTATCGTT 
GCAACCTTAG CTGTGGGAGA ATACTGGGCC AGTTTGATGA TTTIAATTAT GTTGACTGGT 
GGTGATTCAT TAGAAGACTA TGCCGCTGGA AAAGCTAACC AAGAGCTGAA GTCATTATTG 
GATAACTCGC CACAAAAAGC TCATCGCTTG AATGGCGAAA ATTTAGAAGA TGTTTCTGTT 
GAGGAAATCA ATGTTGGCGA TGAATTAGTA GTAAAACCAG GGGAA^TAGT TCCAGTTGAT 
GGCTTGGTAA AAACCGGGAC ATCAACAGTC GATGAATCTT CAT1 A A FAGG AGAATCAAAA 
CCAATTGAAA AAAATCCTGG GGATGAATTA ATGTCGGGTT CCGTGAATGG TGACGGCTCT 
TTGAAAATGG TTGCTGAAAA AACTGTAGCA GACAGTCAAT ATCAAACAAT TGTGAACTTA 
GTGAAAGAAT CTGCGGCGCG TCCAGCTCAT TTTGTACGTT TAGC A ^ATC 3 CTATGCGGTA 
CCTTTTACAC TAGTTGCCTA CCTAATTGCA GGTGTTGCTT GGTTTGTTTC AAAAAGTCCG 
ACACGTTTTG CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC CTTTAATTCT ATCTGCCCCA 
ATTGCTTTAG TGGCAGGGAT GGGTCGTTCA AGTCGTCATG GGGTC 'JTTAT TAAATCGGGA 
ACGATGGTCG AAAAATTAGC TTC TGCAAAA ACGATTGCGT TTGATAAAAC AGGCACGATT 
ACGCAAGGAC AACTTTCTGT TGATCAAGTC CAACCAATCA ATGCTFGAAT AACTGCTGCT 
GAATTAGTGG GATTGGCAGC AAGCGTGGAA CAAGAATCAA GTCATATTTT AGCTAGATCA 
ATTGTTGCTT ATGCCAGAAA GCAAGATGTC CCATTAAAAA ATATTACAGA TCTAGCGGAA 
GTTTCTGGTG CTGGCGTGAA GGCATTTGTG GATGGTGCTG AG AT A 3 3GGT AGGTAAAAAG 
AATTTTGTGA CACAAGAGTC TCAAGAAACT GAAAAAATTG ATAAAACGAC TATTCATATT 
TCACGTAATG GCACATATTT AGGCCGAATT ACTTTTACAG AC A C TG T AC G CCCAGAAGCA 
AAAGAGACTA TGGAAAAATT ACACCAATTA CATCTTCAAC GAATTTTAAT GCTGACGGGG 
GATCAAGAAT CCGTTGCAGA AACGATTGCT GCAGAAGTAG GAATTACCGA AGTACATGGG 
GAATGTTTAC CACAAGATAA ATTAACTATT CTAAAAGAAT TGCCTAAAGA AAATCATCCA 
GTCATCATGG TAGGAGATGG TGTAAATGAT GCACCTTCGC TTGCTGCTSC AGACGTAGGT 
ATTGCTATGG GTGCTCATGG AGCTACTGCG GCTAGTGAAA CTG :TGAC3T TGTTATTTTA 
AAAGATGACT TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CCCAA3ATAC CATGAAAATT 
GCCAAACAAT CTGTATTAAT CGGAATTTTT ATCTGCGTTT TACTAAT3TT AATTGCTAGT 
ACCGGGATCA TTCCGGCGCT AATCGGGGCT ATGCTACAAG AAGTC3T3GA CACTGTGTCA 
ATCTTATCTG CTTTGCGTGC TCGTCGAATT GGCC 

EF077-4 (SEQ ID NO:296) 

QPNWA YGIILITGSV MALMMFWEMI 

QTLREGKYGV DILAITAIVA TLAVGE YWAS LMILIMLTGG DSL.ECYAAGK ANQELKSLLD 
NSPQKAHRLN GENLEDVSVE EINVGDELW KPGELVPVDG LVKTGTSTYD ESSLTGESKP 
IEKNPGDELM SGSVNGDGSL KKVAEKTVAD SQYQTIVNLV KESAAFPAHF VRLADRYAVP 
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FTLVAYLIAG VAWFVSKSPT RFAEVLWA9 PCPr Tr «p T M ,„,.,., 
MVEKLAEAKT IAFDKTGTIT QG Q LSv7 Q V Q PlS^E Z^ -^ * HGWIKSGT 
™r ^im-AEV SCAGVKAFvS GAElS™ k S^HIS 

CLPQDKLTIL L= SEEK I S= ~ " 
EIAQDTMKIA K 0 SV LI G 1FI CVLLMLIAST G IIP ^ fS^SSS 

EF079-1 (SEQ ID NO:297) 

TAATTTCTAG CATCACCGAA GAAATTTTTA GAAAAACAAA Gkcr^-rrr ™ 
CCCAGGCTCT CATGCTTTAT TTTTAAGGAG GAAGCAATGA AGTr^^ 

^™ TGAT TCmTACTG ATTATTGGAA S ^ 
TTTGTTAGCG ATGCATTAAA TAACTATCTG GATCAACAAA TTA^CG^TCA TTATCAAPPl 
AAAGCAAGCC AAGAAAACAC CAAAGAAATG GCTGAACTTr Aar- - ^ ~ TTATCAAGCA 

i=s sssss; =s ™ ™ 

= = ~ = ™ sss 

= sssss s= = Sis ™ 

EE s= = <™ riEi 

SSSSS SSE££ So™ -™ 

1TCATTATCT CGTACAAGCG ACGGAAAAAG ACgISS aSa^I f™ C 

EF079-2 (SEQ ID NO:298) 
MKSKKKRRI IDGFMJ^LLI IGIGAFAYPF 

= s= jsssss 0= s= 

ISGHRGLPQA KLFTDLPELK KGDEFYIEVN GKTLAYQVDQ IKT-v^TD'K 

S EST" 0 HRIPICPm M ™« ™Sge 



EF079-3 (SEQ ID NO:299) 
TCCT 



SSSES ££££ SSS 2= II™ ~" 

=i i= s= ™ ™ 

ATTrPAAZiLR rrT, A^m^rr, WI GAAAGTC ATACGATTGG TGTTTTAACC 

-LLLAAAAA TAAATGTCCG TTTACCAATT TTTpa^^t\ 'vML. l. 

esses r™™" ™^ §ssr sssss 

AAAAAAGGCG ££5S£ SSSJ S?™™ "^"^ 

=S ~ ™ 

TTCATTATCT GGTACAAGCG ACGGAAAAAG ACgSS SccU ™™<»* 



EF079-4 (SEQ ID NO:300) 
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TABLE 1. Nucleotide and Amino Acid Scqcucnccs of E. h:-. Genes. 



?F 

vsl:al::nvld Q^iiAHYyAK 

77 K K P DK S Y F E £ H T 1 3 V 7 T I 
ISGHRGLP'A KLFTDLPELK 
VTLLTCTPYM I NGHR7L.7RG 
1 1 VvYK R F. K K 7 TRKP 



pk;nyrl?:f e?:t::alllek 
kripyqpeka aag:-:kkvac;g 



q^gagygsgf gldpfsetqk 

gyglgggtsy ftggtnthav 

::--':".-£P7ETr. dlhiesgqdl 

l:_v.tlll iacaliisgf 



EF0SC-: ( SEQ ID NO : 3 C 1 ) 



7A3TTACACT 


g^tttaggg: 


^ /-iG'—^-iA^-^T"! 






TAGCACTCTT 


7TTATTAGGA 


G 3TTATT72C 


T A TG AAAA AA. 


:ga:ttttac 


7YA77T7TTT 


CCTAATACTT 


CTTACCTTTG 


GICTTGCCCT 


ACCGGTT7CG 


GCGGCTG/iA-rt 


a. g:aattga 


TGATGGCGCA 


CAATTACTGA 


2 A Z CTGATCA 


AA. T'- - AAC C AA 




AGATACAACC 


TTTAGAAGAA 


AAAACAAAAG 


CCTCTGTCTT 


TATTGTAACC 


AGAAATAATA 


AI ACC7ATGG 


CGATGAACAA 


GAATATGCAG 


A ? JAT7ATCT 


TT i'-kAATAhA 


3TTGGCAAGG 


A" YAAAATGC 


G ATTC TTTTT 


CTCATTGATA 


TGGACTTACG 


GAAAiATC .'AC 


ATC7C7ACTT 


:tggaaacat 


GATTGATTAT 


ATGACAGATG 


CACGAATTGA 


TGATACCTTA 


GATAAAATAT 


gggataatat 


GAGTCAAGGA 


AATTATTTCG 


cgggtgctia 


AACCTTTGTT 


GAGGAAACTG 


aag:atttgt 


TAATAAAGGG 


GTTCCTGGGG 


GG 3ACTATCG 


TGTG3ACAGC 


G.-aAAC AGGTA. 




TTATAAAGTC 


ATTACCCCGC 


TGGAAATGGT 


AATTGCTTTT 


GCTGCTGCGC 


tgagagtcag 


TTTGGTCTTC 


TTAGGCATTA 


ATATTTCTAA 


ATATCAATTA 


AAATTTTCAA 


3T7AT GAATA 


TCCCTTTAGG 


GAAAAAACAA 


CTTTAAACTT 


AACCTCCCGC 


AGAGATCAGT 


TAAICAACTC 


TTTCATCACT 


ACGCGTCGTA 


TTCCTAAAAA 


CAATGGCGGC 


AGTGGCGGAA 


TGGGCGGTG3 


TGGTAGCACC 


ACCCACTCAA 


C PGGCGGCGG 


CACATTCGGT 


GGCGGCGGTC 


iAAAGTTTTTA 


G 



EF080-2 (SEQ ID NO:302) 

MKKR LLPIFFLILL TFGLALPVSA AENSIDDGAQ 
LLTPDQTNQL KQEIQPLEEK TKASVFIVTT NNNTYGDEQE 
I DMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN 
PGGHYRVDSE TGKITRYKVI 7PLEMVIAFA AALILSLVFL 
KTTLNLTSRT DQLTNSFITT RRI PKNNGGS GGMGGGGSTT 

EF0S0-3 { SEQ ID NO:303) 

GGCTGAAA ATTCAATTGA TGATGGCGCA 

CAATTACTGA CACCTGATCA AATCAACCAA CTAAAGCAAG A G AT AC AAC C TTTAGAAGAA 

AAAACAAAAG CCTCTGTCTT TATTGTAACC ACAAATAATA ATACCTATGG CGATGAACAA 

GAATATGCAG ATCATTATCT TTTAAATAAA GTTGGCAAGG VJ7AAAATGC GATTCTTTTT 

CTCATTGATA TGGACTTACG GAAAATCTAC ATCTCTACTT CT3GAAACAT GATTGATTAT 

ATGACAGATG CACGAATTGA TGATACCTTA GATAAAATAT GGGATAATAT GAGTCAAGGA 

AATTATTTCG CGGCTGCTCA AACCTTTGTT CAGGAAACTG A„A 3CATTTGT TAATAAAGGG 

GTTCCTGGGG GGCACTATCG TGTGGACAGC GAAACAGGTA AAATCACTCG TTATAAAGTC 

ATTACCCCGC TGGAAATGGT AATTGCTTTT GCTGCTGCGC 7GA7ACTCAG TTTGGTCTTC 

TTAGGCATTA ATATTTCTAA ATATCAATTA AAATTTTCAA GT7ATCAATA TCCCTTTAGG 

GAAAAAACAA CTTTAAACTT AACCTCCCGC ACAGATCAGT 7AACCAACTC TTTCATCACT 

ACGCGTCGTA TTCCTAAAAA CAATGGCGGC AGTGGCGGAA 7GGGCGGTGG TGGTAGCACC 

ACCCACTCAA CTGGCGGCGG CACATTCGGT GGCGGCGGTC G AAG7 

EF080-4 (SEQ ID NO:304) 

AENSIDDGAQ 

LLTPDQINQL KQEIQPLEEK TKASVFIVTT MNNTYGDEQE YAYHYLLN^V GKDQNAILFL 



YAGC'i YLLNKV GKDQNAILFL 

/ F A AAQTFVQ ETQAFVNKGV 

:-::::skyqlk fssyqypfre 

:: . 7 YGGTFGG GGRSF 
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TABLE 1. Nucleotide and Amino Acid Seqeucnces of E. faecal L\ Genes. 

IDMDLRKIYI STSGNMIDYM TDARIDDTLD KIWDNMSQGN YFAAAQTFV2 ETQAFVNKGV 
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL GINISP YCI-K FSSYQYPFRE 
KTTLNLTSRT DQLTNSFITT RRIPKNNGGS GGMG3GGSTT HSTGGGTFG 3 G5RS 

EF081-1 (SEQ ID NO:305) 

TGAATGGAAC GAAGCAATCG TAATAAAAAA TCTTCAAAAA AACCACTTAT TCTTGGTGTT 
TCTGCCTTGG TTCTAATCGC TGCTGCCGGT GGCGGGTATT ATGCTTATA j TCAATGGCAA 
GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA CATTTTTAAA CGTATTGTCA 
AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 
TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 
GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATiTAATA CACATTTACC 
TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 
ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 
ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTG ^ AGAAATTGTC 
GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 
GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TG A TAAATTC 
GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTOCA AGCAGACTCC 
TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGGAAGCAN TGCGCAATTA A 

EF081-2 ( SEQ ID NO: 306) 

MERSNRNKKS SKKPLILGVS ALVLIAAAGG GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 
QEFDKLPSW QEASLKKNGY DTKSWEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNDKISTQV DNAKRGEIVD 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVDEIKQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG EAXRH 

EF081-3 (SEQ ID NO:307) 

T GGCGGGTATT A TG C TT A TAG TCAATGGCAA 

GCCAAACAAG AATTAGCCGA AGCGAAGAAA ACAGCTACTA C ATT T T T A AA CGTATTGTCA 

AAACAGGAAT TTGATAAGTT ACCGTCCGTT GTTCAAGAAG CTAGCTTAAA GAAAAATGGC 

TATGATACTA AATCTGTTGT TGAAAAATAC CAAGCAATTT ATTCAGGGAT TCAAGCAGAA 

GGAGTCAAAG CTAGTGATGT TCAAGTCAAA AAGGCGAAAG ACAATCAATA CACATTTACC 

TATAAATTAT CGATGAGCAC GCCTTTAGGC GAAATGAAAG ATTTGTCTTA TCAATCAAGT 

ATCGCCAAAA AAGGCGATAC CTACCAAATC GCTTGGAAGC CATCTTTAAT TTTTCCAGAT 

ATGTCAGGAA ATGATAAAAT TTCGATTCAA GTAGATAATG CCAAACGTGG AGAAATTGTC 

GATCGTAATG GTAGTGGGCT AGCAATTAAC AAAGTGTTTG ACGAAGTGGG CGTAGTGCCT 

GGCAAACTCG GTTCTGGCGC AGAAAAAACA GCCAATATCA AAGCTTTTAG TG A TAAATTC 

GGCGTTTCTG TTGATGAAAT CAATCAAAAG TTAAGCCAAG GATGGGTCCA AGCAGACTCC 

TTTGTACCAA TCACAGTCGC TTCTGAACCA GTGACAGAAT TACCAACAGG GGCTGCGACA 
AAAGATACAG AGTCACGTTA TTATCCGCTG GGGG 

EF081-4 (SEQ ID NO:308) 

G GYYAYSQWQA KQELAEAKKT ATTFLNVLSK 

QEFDKLPSW QEASLKKNGY DTKSWEKYQ AIYSGIQAEG VKASDVQVKK AKDNQYTFTY 
KLSMSTPLGE MKDLSYQSSI AKKGDTYQIA WKPSLIFPDM SGNFKISIQV DNAKRGEIVD 
RNGSGLAINK VFDEVGWPG KLGSGAEKTA NIKAFSDKFG VSVLEINQKL SQGWVQADSF 
VPITVASEPV TELPTGAATK DTESRYYPLG 
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C77TTAAATA G77CAAAAGT T3AAGCAGCT 
AT7ACG7T7G 2TCT7GATAA TAC7G7CACG 
7ZTGTGACAC CTAA7CCTGC TGATCC77A7 

ga:tatgttt iaaa7ATc za ttt73gatca 

GGGGGA 2AAC 7GGA7CAA3T GCAAAATA3T 

3TTCAAGTAA 2TGACAAACG TGGTC7AAAT 

GOGGAA77TG ZTACAAGT3A 7TCAA ZAZZC 

TTAGCA3CAA GAG GC AA77C AACACAGTTA 

ACZT7GGATC GAAG7GG7GG CGCCAG77:7 

ATGGGCACTT G3ACATTAGC TTT7GG7AGG 

AG7G77CCTG TGAGAACGAA AAAAG77G 2A 

ATTTT3GATG ATACACCACT TTAA 



3TA 



_ ^ i JO> w 

:tttc 



2ATT 
7GGA 

:ttgg 
;ggga 

iCAAT 



:tct 

4AGG 



gcctcttatg 
2aacgctgat 
::cttctcag 
gcttagtatt 
agcgatctat 
gccaaactat 
taaacagagt 
cttgaccttt 
ggtcccagta 
ttcaacagga 
cattcaatta 
gc ttacttgg 



EF082 ■ 



(SEQ ID I0O : 3 1 C ) 



MKKIVRISS I LFVATPLML LNSSKVEAAQ VASIQSaAGI TEA: 

vtpnpaophq pgtagplsid yvgnihfgsk qiqagtaiys a;; 

QVTDKRGLNL GWKLSVKQSA QFATSDSTFA 7LDNASLTFL AA 
2DPTGAATSP VATAALSTGM GTWTLAFGSG XTAA^GIQLT Vr, 
LDDTPL 



777? pvkptnpsqp 
.*q:jst gdlisvpnyv 
atqal slapltvpvt 
:kvaa kqykttltwi 



EF082-3 (SEQ ID NO : 3 1 1 ) 

AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 

ATTACGTTTG CTCTTGATAA TAC7GTCACG CZACCTGTCA A"'""' GACGAA CCCTTCTCAG 

2CTGTGACAC CTAATCCTGC TGAT3CTCAT 7AACCTGGTA CAGCZGGAZC GCTTAGTATT 

GACTATGTTT CAAATATCCA TTTTGGATCA AAA C AAA TT C .A ^ A\ AAAAZ AGCGATCTAT 

TCGGCACAAC TGGATCAAGT GCAAAATAGT ACTG3CGATT TAA77AGC37 GCCAAACTAT 

GTTCAAGTAA CTGACAAACG TGGTZTAAAT I77G3 3TGGA AAT7A7CAGT TAAACAGAGT 

GCGCAATTTG CTACAAGTGA TTCAACAGCC GZTGTTTTGG A7AATGCATC CTTGACCTTT 

TTAGCAGCAA CACCCAATTC AACACAGT7A CTTTZTTTGG CC CC ATTAAC GGTCCCAGTA 

ACCTTGGATC CAACTGGTGC CGCCACTTCT C GTGTGGCGA G7GGCGG7 ZT TTCAACAGGA 

ATGGGCACTT GGACATTAGC TTTTGGTAGG GGAI JZGACCG CTGC7CAAGG CATTCAATTA 

ACTGTTCCTG CGACAACGAA AAAAG7TGCA i JCTAAACAAT ATAAAACAAC GC TTACTTGG 
ATTTTGGATG ATACACCACT 

EF082-4 ( SEQ ID NO:312) 

AQ VAS1QSNADI TFALDNTVTP PVNPTNPSQP 
VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAG7AIYS A; 7 
QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL A AT 
LDPTGAATSP VATAALSTGM GTWTLAFGSG XTAAQGIQLT \"AA 
LDDTP 

EF083-1 (SEQ ID NO:313) 

TAATTTAAAA GACAAGGAGA AATAAAAATG AAAAAGAAAA TT77AGCAGG AGCGCTTGTC 

GCTCTGTTTT TTATGCCTAC AGCTATGTTT GCCGCAAAAG GAGACCAAGG TGTGGATTGG 

GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATGATAAATT CGCTATTGCC 

CAGATTGGAG GCTACAATGC TAGCGGTATT 7ATGAACAAT AAA:: A7A7AA AACGCAAGTG 

GCAAGTGCTA 77GCCCAAGG TAAAGGTGCG CATACCTATA 777AGTATGA CACTTGGGGA 



g;vq:;st gdlisvpnyv 
77stqll slapltvpvt 

7TKKVAA KQYKTTLTWI 
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TABLE 1. Nucleotide and Amino Acid Seqeucnces of E. faeces lit Genes. 

AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CACCTAvTCA AACGCCTAAA 
AATTCCATCG TTGCATTAGA TTTTG AAC AT GGAGCGTTGG CTA-.^ JTTCC AGATGGATAT 
GGAGGATATG TAAGTTCAGA TGCCGAAAAA GCAGCAAATA CAGASACAAT TTTGTACGGT 
ATGCGCAGAA TCAAACAGGC TGGCTATACT CCAATGTATT ACA ,; C7ATAA GC C ATTTAC A 

Saaatcatg taaactatca acaaatcatc aaagagtttc ctaact:ttt atggattgct 

GCGTATCCTA TCGATGGTGT GTCACCATAT CCATTGTATG CTTATTFCCC AAGCATGGAT 

GGTATTGGTA tttggcaatt cacatccgct tatattgcag gtg-ttaga tggtaacgta 

GATTTAACAG GAATTACGGA TAGTGGTTAT ACAGATACCA ATAAAC-M.A AACGGATACG 
CCAGCAACAG ATGCAGGCGA AGAAATTGAA AAAAT AC CTA ATT-TCATGT TAAAGTTGGC 
GATACCGTCA AAGTGAAATT TAATGTAGAT GCTTGGGCAA CT' " :.AAGC TATTCCGCAA 
TGGGTAAAAG GAAACAGCTA CAAAGTGCAA GAAGTAACTG GAA 'J TAGAGT ATTGCTTGAA 
GGTATCTTGT CATGGATTAG CAAAGGTGAT ATTGAATTAT TGC "A .7-AC'jC AACAGTCGTC 
CCTGATAAGC AACCAGAAGC GACTCATGTG GTACAATACG GAG AAA C ATT ATCAAGTATT 
GCTTATCAAT ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAA 1 -7 2ATT GGCTAATCCA 
AATCTTATTT ATCCTGGTCA AGTTTTGAAA GTCAATGGAT CGCCAATAAG TAATGTCTAC 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AA'-'TTv-GCAC TACTTATCAA 
GCTTTAGCTG CATTAAACGG ATT AG C AAAT CCTAACTTGA T- '■ -»G- TCAAACATTG 
AATTATTAA 

EF083-2 (SEQ ID NO:314) 

MK KKILAGALVA LFFMPTAMFA AKGDQGVDWA IYQGEQGRFG Y.-.KDKFAIAQ 
IGGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTWGN ML : AF TTMDY FLPRIQTPKN 
SIVALDFEHG ALASVPDGYG GYVSSDAEKA ANTETILYGM RF : F.". AC-iTP MYYSYKPFTL 
NHVNYQQIIK EFPNSLWIAA YPIDGVSPYP LYAYFPSMDG IG7V-TSAY IAGGLDGNVD 
LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSDVKVGD TV:-, ,WDA WATGEAIPQW 
VKGNSYKVOE VTGSRVLLEG ILSWISKGDI ELLPDATWP DF. AJHVV Q^ETLSSIA 
YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VI.: —SSI AAKLGTTYQA 
LAALNGLANP NLIYPGQTLN Y 

EF083-3 (SEQ ID NO:315) 

AAAAG GAGACCAAGG TGTGGATTGG 

GCGATTTATC AAGGTGAACA AGGTCGCTTT GGCTATGCAC ATA-A AAAATT C GC TATTGC C 
CAGATTGGAG GCTACAATGC TAGCGGTATT TATGAACAAT AC A AATATAA AACGCAAGTG 
GCAAGTGCTA TTGCCCAAGG TAAACGTGCG CATACCTATA TTTOATATGA CACTTGGGGA 
AACATGGACA TTGCGAAAAC AACAATGGAT TACTTTTTGC CAC3TATTCA AACGCCTAAA 
AATTCCATCG TTGCATTAGA TTTTGAACAT GGAGCGTTGG CT A-'TGTTCC AGATGGATAT 
GGAGGATATG TAAGTTCAGA TGCCGAAAAA GCAGCAAATA CAGASACAAT TTTGTACGGT 
ATGCGCAGAA TCAAACAGGC TGGCTATACT CCAATGTATT A 'AAC L'TATAA GCCATTTACA 
CTAAATCATG TAAACTATCA ACAAATCATC AAAGAGTTTC CZ .A.A'.'TCTTT ATGG ATTGC T 
GCGTATCCTA TCGATGGTGT GTCACCATAT CCATTGTATG C'AATTTCCC AAGCATGGAT 
GGTATTGGTA TTTGGCAATT CACATCCGCT TATATTGCAG GTOCTTTAGA TGGTAAC GTA 
GATTTAACAG GAATTACGGA TAGTGGTTAT ACAGATACCA A'.'AAACCAGA AACGGATACG 
CCAGCAACAG ATGCAGGCGA AGAAATTGAA AAAAT ACCTA A'—TGATGT TAAAGTTGGC 
GATACCGTCA AAGTGAAATT TAATGTAGAT GCTTGGGCAA C7G AGGAAGC TATTCCGCAA 
TGGGTAAAAG GAAACAGCTA CAAAGTGCAA GAAGTAACTG GAM3CAGAGT ATTGCTTGAA 
GGTATCTTGT CATGGATTAG CAAAGGTGAT ATTGAATTAT T AC r.AGACGC AACAGTCGTC 
CCTGATAAGC AACCAGAAGC GACTCATGTG GTACAATACG GA,AAAACATT ATCAAGTATT 
GCTTATCAAT ATGGAACAGA CTATCAAACG TTGGCGGCAT TAAATGGATT GGCTAATCCA 
AATCTTATTT ATCCTGGTCA AGTTTTGAAA GTCAATGGAT CG3CAACAAG TAATGTCTAC 
ACGGTTAAAT ACGGCGATAA TTTATCTAGT ATTGCAGCAA AACTTGGCAC TACTTATCAA 
GCTTTAGCTG CATTAAACGG ATT AG C AAAT CCTAACTTGA TTTATCCAGG TCAAACATTG 
AAT 
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TABLE 1. Nucleotide and Ammo Acid Seqcucnces o: L. hiccah: 



EFC63-4 



NC : 3 1 k 



K.GDQGVDWA IYQGEQGRFG YAKEKFAIAQ 

:3gy:;asg:y eqvtykt^va saiaogkkah rviv.vGT 

F : VALDFEHG ALAS VP EG YG GYVS S GAErlA ANTE 7 1 L 
KHVNYQQIIK FFPNSLWIAA YPTDGVSPY? T.Y'AYFPF 
LTGITDSGYT DTMKPETDTP ATEAGEEIEK IF::SEYK 
VKGKSYKVQE VTGSRVELEG ILSWISKGCI ELLFEAT 
YQYGTDYQTL AALNGLAKPN LIYPGQVLKV 
LAALNGLAN? XLZYPG^TLN 



^ c - v r 



3C TJKVKF 
LFEA7YYP DKC?E.- 
SATSITvYT YKYGE: 



■ lpriqtpkn 
■:yysyk?ftl 

■:atgeaipqw 
:>ygetlssia 
vaxlgttyqa 



EF084-1 ( SEQ ID NO:317) 



TAGTCAAACG 

CTATTGAATT 

AGCGGTATGA 

AAAAATCGCC 

AATAATCGGC 

AGCGCATTTT 

CACTTAAAAG 

GTTGATGCGC 

GTCACAAAAT 

ATTTATGCAC 

ACTGGTGATG 

ACCGAATCAC 

TCTTTACAAG 

CAAATGGATA 

AGTTTTATTA 

GTGTTTCCGA 

CTTTCCTTTG 

GAAAAATCTT 

GAGAGCAAAT 

ACTGGAGGCT 

GCAGAAATTG 



TTTATTTTTT 

GGAGGAGAAC 

TGGCCACGAT 

GTGTTGGGAA 

CCGCAGAAGA 

CACAATTTGA 

AAGAAGATCA 

TATTTAACCG 

TACTACGAAA 

CGTGTGTTGT 

GCTATAAACT 

CTATTATTTC 

ATGTTAATTT 

TGCTGTTTAC 

ACCAAGAATT 

CAAAATCTTT 

TGGAACTACT 

TTGTCACAGG 

TAGTCAATGG 

ACAATGTTAC 

CAGAATACAC 



CCTTAAH-'^ 

TATGAAGAAA 

TGCGGCCGCC 

AAAATTATTA 

AATCATTTCA 

TAACTATGAT 

CGGACGCATG 

CATTAACGAA 

AGACGATCAA 

ATTAACAACT 

AGCCAAAAAA 

TGAAGAACCT 

AACTGTTTTG 

ACATTTTGGC 

AACTCGCAAC 

TGAAGAAGTG 

GAAAGACTTT 

CGGTGGGATT 

TTTATTTTTT 

AGCTGCATTT 

CTATTTACCA 



A G AAA-AAA T C 
TTTGATG1AA 
GAAG JAGGCG 

atga:tgctg 

TTTATTCCTG 

at-atgaact 
tt:c :t:-tta 

TTAG3A3TCA 
ATAATT3GCG 
GGCGGCCGCA 
ATGGGGCATA 
TTTATC 3TGG 
AACCAAAAAG 
ATTTCAGGAC 
GGTAATCAAC 
CCTGICAAAC 

cagtt:act3 
tccct3aaag 
gctggtgaac 
gtcactggac 
attgaagaag 



CCGTA.- T f 
TTATTGT. 

ctcaa/t 
;cggc. .:• 

GGAAT..-G 

: AG at; _A 
TTGAAAC 

CTTAT::: 

C CATC AG 

ataaa;. : 
gaaaa:;: 
ctg: 
ctgt:a: 

AACTAA;: 
TTACT-A 
AAGTT-,: 

ttttag; 

atgt; 

tcta^. 



7,7 GG TAG ACTAC 

tgctgggacg 
.77 attgattgaa 
:tg taatgtaacc 
,aa ATTTTTATAC 
,t; caatggtatt 
,7 : gaagtcaatt 
:ac aaaaa c a c ag 
;ga actggaaaaa 
:-t: cacaggagca 
:c: g:tctaccct 
:-tt g:aaggtctc 
7tt agttaatcat 
;,c7 c3gctgttct 

jCCTTGGAT 
AAAGCAACGN 
ATTGCCTTTG 
TAAAACAATG 
TAATGGCTAT 
CTCCCATGCC 



2T 



;at 



EF084-2 { SEQ ID NO:318) 

MKKF DVIIVGAGTS GMKATIAAAE AGAQVLLIEK 

NRRVGKKLLM TGGGRCNVTN NRPAEEI ISF I?Gi:GKFLYS AFSCFENYDI 
LKEEDHGRMF PVTDKSKE1V DALFNRIIIEL GVTVFTKTQV TKLLRF.L CQI 

yapcwlttg grtypstgat gdgyklakkm ghtifplypt esp::sfepf 

LQDVNLTVLN QKGKPLVNHQ KDMLFTHFGI SGPAALFCSS FINC EL-TRIJG 
FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ FTVTKTLPLE KSFYTGGGIS 
SKLVNGLFFA GELLDINGYT GGYNVTAAFV T3HVAGSHAA EIAF77YLPI 



MNFFESNGIH 
IGVETELEKI 
: LDKTLQGLS 
IIQPVTVALDV 
LKEVTPKTME 
EEV 



EF084-3 (SEt ID NO:319) 

C GAAGCAGGCG CTCAAGTATT ATTGATTGAA 

AAAAATCGCC GTGTTGGGAA AAAATTATTA ATGACTGGTG GCGGCCGCTG 
AATAATCGGC CCGCAGAAGA AATCATTTCA TTTATTCCTG GGAATGGAAA 
AGCGCATTTT CACAATTTGA TAACTATGAT ATCATGAACT TTT7TGAATC 
CACTTAAAAG AAGAAGATCA CGGACGCATG TTCCCTGTTA CAGATAAATC 
GTTGATGCGC TATTTAACCG CATTAACGAA TTAGGAGTCA CTGT7TTTAC 



TAATGTAACC 
ATTTTTATAC 
GAATGGTATT 
GAAGTCAATT 
AAAAACACAG 
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TABLE 1. Nucleotide and Ammo Acid Seqeuenccs ot E. faea: -is Genes. 



GTCACAAAAT TACTACGAAA AGACGATCAA 
ATTTATGCAC CGTGTGTTGT ATTAACAACT 
ACTGGTGATG GCTATAAACT AGCCAAAAAA 
ACCGAATCAC CTATTATTTC TGAAGAACCT 
TCTTTACAAG ATGTTAATTT AACTGTTTTG 
C AAA TGG ATA TGCTGTTTAC ACATTTTGGC 
AGTTTTATTA ACCAAGAATT AACTCGCAAC 
GTGTTTCCGA CAAAATCTTT TGAAGAAGTG 
CTTTCCTTTG TGGAACTACT GAAAGACTTT 
G AAAAATC TT TTGTCACAGG CGGTGGGATT 
GAGAGCAAAT TAGTCAATGG TTTATTTTTT 
ACTGGAGGCT ACAATGTTAC AGCTGCATTT 
GCAGAAATTG CAGAATACAC CTATTTACCA 



ATAATTGGCG TTGAAA :CGA ACTGGAAAAA 

ggcggccgca ctt; : : :ttc CACAGGAGCA 

ATGGGGCATA CCATCAGCCC GCTCTACCCT 

tttat:ctgg ataaaacgtt gcaaggtctc 

AACCAAAAAG GAAAACTTTT AGTTAATCAT 

atttcaggac ctggzgcgct ccgctgttct 

GGTAATCAAC CTG': 'JA'JGGT AGCCTTGGAT 

cctg:c aaac aac:aa:aga aaagcaacgn 

CAGTTCACTG TTAA G A AAAC ATTGCCTTTG 

tccctcaaag aagtga-ccc TAAAACAATG 

GCTGGTGAAC TTTTAGATAT TAATGGCTAT 
gtca:tggac ATGTi 3 2TGG CTCCCATGCC 
ATTGAAGAAG TC 



EF084-4 (SEQ ID NO:320) 



E AGAQVLLIEK 

NRRVGKKLLM TGGGRCNVTN NRPAEEI ISF 
LKEEDHGRMF PVTDKSKSIV DALFNRINEL 
YAPCWLTTG GRTYPSTGAT GDGYKLAKKM 
LQDVNLTVLN QKGKPLVNHQ MDMLFTKFGI 
FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ 
SKLVNGLFFA GELLDINGYT GGYNVTAAFV 



IPGNGKFLYS AFSy-FT-NYDI MNFFESNGIH 
GVTVFTKTQV TKLLF- KDDQI IGVETELEKI 
GHTISPLYFT ESP7 2SEEPF ILDKTLQGLS 
SGF AALRCSS FIT', KLTRNG NQPVTVALDV 
FTV7KTLPLE K5" F '."T GGGI S LKEVTPKTME 
TGHVAGSHAA EIAAA.TYLPI EEV 



EF085-1 (SEQ ID NO:321) 

TAACCCATGA AATCATTTTG TCCCGCATAT 
CAGTCCACAC TCATATCAAA AATCTGCGGG 
CCGCGGTGTA GGTTACCGAT TGGAGGAATC 
AAGGTTTTTT CCTATACGAT CATTGTCCTG 
TTTGCACAGC AATTTGTGTC TTATTTCAGA 
TATCAGCCAT TGGTGGAACT GATTCAGAAT 
GGGCTGTTTC ACTACAATAA CCAATCCTTT 
GTACTCTATG CCACACCGAA TGCCGATACA 
GTGGTACATA GAGATGATAA TATTTCGATT 
CTTTATCAAG GGCTGACAAT TCGGGGAATT 
CTTTTATGCG CGTATATCTT TGCGCGGCAA 
AGTGCGAATA AAATGGCAAA CCTGAAAGAA 
CTTGGCGCAC TGGCTCACGA CATGCATTCC 
AGGCTGGAGG ATGAAATCGC AAGGGAACAT 
GCGGCAGCCT CTCATGAGTT AAAAACGCCC 
ATGCTTGAAA ATATCGGTGA CTACAAAGAC 
ATGATGGACA GGCAGGGCAA AACCATTTCC 
GGGAGAATCG TACCCATAGC CGAACCGCTG 
CCCGATTTTC AAACCTTGGC AGAGGCAAAC 
GGACAAATTG TCCTGTCCGA TCCGAAGCTG 
AATGCGGTTC AGAACACGCC CCAGGGAGGT 
GAAAAATACC GTCTTTCCGT TTTGAACATG 
AAGCTGTTCA TCCCATTCTA TCGCATTGAT 
CGGTTTGGGG CTTGCCATCG TACAAAAAAC 
GGAAAACACC TCAGATGGCG TTTTGTTCTG 
AATATTTAA 



GGGGATATGA CTTT-: 4ACGGT GATGGCAGCA 
CGAACTGCCG GAA.AATATCA TCAAAACCAT 
ATTATAATGG AA AG. AA A AG 3 GATTTTCATT 
TTACTGCTTG TCGGTCTAAC GGCAACACTG 
GCGATGGAAG CACAGCAAAC AG TAAAATC C 
AGCGATAGGC TTGATATGCA AGAGGTGGCA 
GAGTTTTATA TTGAAGATAA AGAGGGAAGC 
TCAAATAGTG TTAG "5GCCGA CTTTCTTTAT 
GTTGCTCAAA GCAA^GCAGG TGTGGGATTG 
GTTATGATTG CGA' AAATGGT TGTATTCAGC 
ATGACAACGC CGAT^AAAGC CTTAGCGGAC 
GTACCGCCGC CGCTGGAGCG AAAGGATGAG 
ATGTATATCA GGCT3AAAGA AACCATCGCA 
GAGTTGGAGG AAACA-AGCG ATATTTCTTT 
ATCGCGGCTG TAAGCGTTCT GTTGGAGGGA 
CATTCTAAGT ATCTG:GCGA ATGCATCAAA 
GAAATACTGG AGCTT^TCAG CCTGAACGAT 
GACATAGGGC GCA.r 3 3TTGC CGAGCTGCTA 
AACCAGCGGT TAG7CACAGA TATTCCAGCC 
ATCCAAAAGG CAA7ATCCAA TGTCATATTG 
GAGGTACGGA TA r : AGAGTGA GCCTGGGGCT 
GGCGTTCACA TTAAT3ATAC TGCACTTTCA 
CAGGCGCGAA GO' AGC AAAAA GTGGGCGAAG 
GCTGGATGCC ATGAGCCTCC AATATGCGCT 
GCTGGATTTA CCGCCCACAT CAACACTATA 
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TABLE 1. Nucleotide and Ammo Acid Scqeucnces o; E fjecuis 



EF085-G (SEQ ID 



MERKGIFIK 



7IZVLL LLYG7TATLF 
■ YIEDKEGSY 



• v " — pv » r"Tc 

LC AY I FAR 1 ,, 
I ARE HE 
:vtt CTT 



LFHVh";QSFE 
YQGLTIRGIV M I A I MW F 
GALAKCMHSK YIRLKETIAF LEGE 
LENIGEYKDK SKYLRECIKK XERQ 
DFQTLAEANN £RFVTEIPAG CIVLSEPKI 
KYRLSYLNKG YHICTTALSE LFIPFYRII 
KHLRWRFVLA GFTAHINTIK : 



LEG- 



t S£K>". 



r" A" G FEI 

■■3 EG EI VP 1^ 

:e:; avgmt; 

\Et : . FGAChr 



Er.EEKQEVAG 

a;e:gagvgll 
pt: gerkgel 
aaysyllegm 

IGETVAELLP 

yr:v:sepgae 
ggeeppicag 



EF085-3 ( SEC 



NC 



GC AATTTGTG 
TATCAGCCAT 
GGGCTGTTTC 
GTACTCTATG 
GTGGTACATA 
CTTTATCAAG 
CTTTTATGCG 
AGTG CGAATA 
CTTGGCGCAC 
AGGCTGGAGG 
GCGGCAGCCT 
ATGCTTGAAA 
ATGATGGACA 
GGGAGAATCG 
CCCGATTTTC 
GGACAAATTG 
AATGCGGTTC 
GAAAAATACC 
AAGCTGTTCA 
CGGTTTGGGG 
GGAAAACACC 
AATATTT 



TC TTATTTCAGA C;CGATGGAAG CACAGCAAAC a^- 
TGGTGGAAGT GATTCAGAAT AGCGATAGGC TTGAI .- 
ACTACAATAA CCAATCCTTT GAGTTTTAT A TTGAAC 
CCACACCGAA TGCCGATACA TC AAATA I-TG 7TAGG: 
GAGATGATAA TATTTCGATT GTTGCTC AAA GC AA IG 



GGCTGACAAT TGGGGGAATT GGTA. GA GEE CG^T,-^ 
CGTATATCTT TGCGCGGIAA ATGACAAG' ; C „l.A._/ 
AAATGGCAAA CCTGAAAGAA GTACCGCCGC CECT-G 
TGGCTCACGA CATGCATTCC ATGTAGATGA GEGTG. 
ATGAAATCGC AAGGGAACAT GAG1TGGAGG AAACA 
CTCATGAGTT AAAAACGCCG AGCGCGGCGG TAAGG 
ATATCGGTGA CTACAAAGAC CATTCTAAGT ATCGG 
GGCAGGGCAA AACCATTTCC GAAATACTGG AGCGG 
TACCCATAGC CGAACCGCTG GACAGAGGGC GCACG 
AAACGTTGGC AGAGGCAAAC AACGAGGGGT TCGTC 
TCCTGTCCGA TCCGAAGGTG ATCOAAAAGG CGCTA 
AGAACACGCC CCAGGGAGGT GAGGTAGGGA TATGG 
GTCTTTCCGT TTTGAACATG GGCGTTCACA TTGA1 
TCCGATTCTA TCGCATTGAT CAGGCG ":GAA GCAG: 
CTTGCCATGG TACAAAAAAC GCTGGATG CC ATGA: 
TCAGATGGCG TTTTGTT CTG GCTGGATTTA CCGG: 



CGCA AGAGGTGGCA 
AGAA AGAGGGAAGC 

;CAGG TEGGGGATTG 

, TGTATTCAGC 

-.AAI.G CTTAGCGGAC 

A-.GIG AAA GGATGAG 

:-AA I-A AAC CATCGCA 

: AG EG ATATTTCTTT 

; ; z GTGGGAGGGA 

:e;ea agggatcaaa 
etc ag ccggaacgat 
gttgc ggagctgcta 
ag' ag a tattccagcc 
gegaa tgtcatattg 
agcga gectggggct 
g ag ac tgcactttca 

kAJJ-Jt G TGGG C G AAG 
iecggc AATATGCGCT 

eeaoat eaagactata 



EF085-4 (SEQ' ID NO:324) 

QFVSYFRA MEAQC'TVKSY QPLVELIQI4S DRLDMCEVAG 
LFHYNNQSFE FY1 EDKEGSV LYATPNADTS KSVRPEFEYY 
YQGLTIRGIV MIAIMWFSL LCAYIFARQM TTPIKALAES 
GALAHDMHSM YIRLKETIAR LEDEIAREHE LEEGQFYFFA 
LENIGDYKDH SKYLRECIKM MDRQGKTISE ILELYSLG.G 
DFQTLAEANN QRFVTDIPAG QIVLSDPKLI QKALSKV ILK 
KYRLSVLNMG VHIDDTALSK LFIPFYRIDQ ARSSKEWAKF. 
KHLRWRFVLA GFTAHINTIN I 



vkfg: 

AMH-'G 
AASrG 
FTT 
AVCFE 

EG AC' 



EF086-1 (SEC ID NO:325) 

TAACTGGTGG GATTGGCAAA TTGGTTCCGC GCAGGGCTAA CAEA^ 
CATGATGACC TATTGAATAC AGATGCAGAA AAATTAAATA AATTI 
CTGTATGCAA AAGATCCAAA CATACAATGG CCAATTTATC GTGC- 
AC AG AT ATTT CAATCACCGT TTTAGGTACT GGA'CTTTTET TAGA, 



:e:v ag;gkagvgll 

_K EV PPPLERKDEL 
'7? I AAVSVLLEGM 
IV LI TGRTVAELLP 
;>GGE VFIWSEPGAE 
EKNA GCHEPPICAG 



,CATT GATTTTATTA 
.EvGG TCCGCTGATG 
- EAGG AGCTAACTTA 
l-AGAA GCAACGCCTA 
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TABLE 1. Nucleotide and Amino Acid Seqcncnccs of E. faecxiis Genes. 

GTACAAGTAC AAGAAGCTGT TCCGTCCGTT TTAAAAAGTG TTT CCTCTGG TGATGGCTTA 
TATCCTGATG GTTCCTTGAT TCAACATGGT TATTTTCCGT ACAACGGCAG TTACGGGAAT 
GAGTTGCTAA AAGGGTTTGG ACGAATTCAG ACTATTTTAC AAGGTTCCGA CTGGGAGATG 
AATGACCCTA ACATTAGTAA TTTATTTAAT GTTGTGGATA AAGGTTACTT A C AATTG ATG 
GTAAATGGAA AAATGCCATC GATGGTTTCT GGTAGAAGTA TTTCCAGAG Z GCCAGAAACG 
AATCCTTTTA CTACAGAGTT TGAATCGGGT AAAGAAACAA TAGCTAATTT AACCTTAATT 
G C AAAATTTG CACCAGAAAA TTT AA G AAA T GACATTTATA CATCTATCCA AACGTGGCTT 
CAACAAAGTG GGTCATACTA TCATTTCTTT AAAAAACCAA GAGA7TTTGA AGCGTTAATT 
GACTTGAAAA ATGTAGTGAA TAGTGCGTCA CiTTGCCCAAG CGACAC .:AAT GCAATCTTTA 
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA ACGAATATGC GGTGGGGATC 
AGTATGTATT CACAACGTGT CGGAAACTAT GAATTTGGGA ATA 7GGAAAA TAAAAAAGGC 
TGGCATACAG CAGACGGCAT GCTTTATTTA TA 7AATCAAG ACTTTGCTCA GTTTGATGAA 
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA CGA7AGTTGA CACAAGAGAA 
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGGG TAGG TGGCTCAAAT 
AATGGACAGG TTGCCTCTAT AGGAATGTTT TTACATAAAA GTAYrGAAGG AATGAACTTA 
GTTGCTAAAA AATCTTGGTT CTTATTAGAT GGTGAAATCA TTAATTTGGG AAGTGGCATT 
ACTGGTACGA CAGATGCTTC GATTGAAACA ATC 7TCGATA AT "J jGATGAT TCATCCACAG 
GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGATTA^TTT AAGCGCAGCG 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTCNA TAAYTACGCT TGATGTTCAA 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT ACTTTGTTAA TGATAAAACC 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGA:TGTTGA AAATGGTACT 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TCG -AG 7TCT TTCTAAAAAC 
AAAG G C TATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCATTGAAGC AGGTAATTAT 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GA 7 r jTATGC GTATGATCCA 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCYTAACTCT TGCGAATCCT 
TTACAAAATA ATGCATCCGT TTCTATTGAA TTTGATAAGG G 7 A CTCTTGA AGTAGTCGCA 
GCGGACCCAG AAATTTCTGT TGACCAAAAT ATTATCACTT TAAATA^TGC GGGGTTAAAT 
GGCAGCTCGC GTTCAATCAT TGTTAAAACA AC7:CT3AAG TAA7 3AAAGA AGCGTTAGAA 
AAATTAATTC AGGAACAAAA AGAACACCAA GAA AAAG ACT A 7AC 7G TAAG CAGCTGGAAA 
GTCTACAGCG AAGCATTGAA ACAAGCACAA ACT jTGGCAG ATC A AACAAC AGCAACGCAA 
GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTCGGCAG TGAAGCAATT GGTAAAAGTG 
CCAACTAAAG AAGTAGATAA AACCAACTTG TTG AAA ATC A T 3 A A A 7 AAAA CGAGAAACAC 
CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTZTACA G1CAA7 7ATT GAAGCAAGCG 
CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAA 3 TAGA::AAGC AGAAGCAAAA 
CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAGTG GC-G AAAATAA AAAG GAG C AA 
AAAAATGGGG GGAATAATGG ACACTTAAAT AC TAG TAG AG GAGTTCATCA AACTGGTACG 
AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTCAGAAAAG CTAGCl'AATT TTTACCGAGC 
ACAGGAGAAA AGAAATCGAT CGCGCTTGTG ATTATTGGTC TTCTAGTTAT CGCCAGTGGG 
TGTCTTTTAG TTTTTCGTAA AAGTAAATCG AAGAAGTAA 

EF086-2 (SEQ ID NO:326) 

LVGLANWFRA ALTDTLILLH DDLLNTDAEK Lt'JYFC APLML YA! -7 ?1 ' IC.-WP I YRATGANLT 
DISITVLGTG LLLEDNQRLV QVQEAVPSVL FSVSSGDGLY I I/GCC 2 QHGY FPYNGSYGNE 
LLKGFGRIQT ILQGSDWEMN DPNISNLFNV VDKGYIjQLKV NC-r KPfMVSG RSISRAPETN 
PFTTEFESGK ETIANLTLIA KFAPEULRND TYTSIQTWLQ C-^'SYYHFFK KPRDFEALID 
LKNWNSASP AQATPMQSLN VYGSMDRVLQ FIIKEYAVGIS Ml 7 £R VGNYE FGNTENKKGW 
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTTYDTFEL AI1C-AYTGKRS PQSWVGGSNN 
GQVAS IGMFL DKSNEGMNLV AKKSWFLLDG QIim.GSGIT GTTDAFIETI LDNRMIHPQE 
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP USHCTLDVCI EERSGRYGDI NEYFVNDKTY 
TNTFAKISKN YGKTVENGTY EYLTWGKTN EEI AALSKIIK GYTYL.ENTAN LQAIEAGNYV 
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG ^r/F LTLAT.'PL t.'I'-^AS V'SIEF DKGILEWAA 
DPEISVDQNI ITLNSAGLNG SSRSIIVKTT PEVTKEALEK LIC'EC'FEHQE KDYTASSWKV 
YSEALKQAQT VADQTTATQA EVDQAETELR SAVX^LVFYP TKFYDKTIJLL KIIKENEKHQ 



WO 98/50554 



PCT AJS98/08959 



TABLE 1. Nucleotide and Ammo Acid Scqeue^c^ c- 



jenes. 



FKGYTAS SWK VA 

nggn'Nghlnt sz 

LLVFRKSKSK K 
EF086-3 (SEQ 



GCTGT?: C\*KFSiCGGr 



tSGENKKEQK 

t gllv:asgc 



ID N3:3 2~^ 



ACCAGAAAA TTTAAGAAAT GAC ATTTATAj--^ ^ - 
CAACAAAGTG GGTCATACTA TG ^ TG ^; ^77 
GACTTGAAAA ATGTAGTGAA TAGTGG ^ T ^ 77 77~ 
T j. mpTATATG GTTCGATGGA TCGAGT7C7A 
- G ^r r -,m A m- CACAACGTGT CGGAAA-TA- - 
TGGCATACAG C AGACGGC AT GCTTTATTTA TATA 
GGATACTGGG CAACGATCGA TCCAT^T^ . ^ 
TTGGCAAATG GTGCTTATAC AGGGAAAC^- 
AAT 

EF08C-4 (SEQ 12 NO:326^ 

P^NLRND IYTSIQTW^Q QSGSYYKFFK KP^F^ 
LKNVVT-JSASP AQATPMQSLN VYGSMGRY7C ^ 
HTADGMLYLY NQDFAQFDEG YVJATIDPYR- ?- 



A. 7 3 A_A G 



.7TTTGA AGCGTTAATT 
iacgaat GCAATCTTTA 
■ A'T 1 AT 3 C GGTGGGGATC 
;7AAAAA TAAAAAAGGC 
;t::;ctca GTTTGATGAA 
■:a:ttga cacaagagaa 
-g: : tagg tggctcaaat 



ttvdtf 



-g:iye fgntenkkgw 
7gkrs pcswvggsnn 



EF087-1 ( SEQ ID NO : 3 2 9 ) 



taactggtgg 

CATGATGACC 
CTGTATGCAA 
ACAGATATTT 
GTACAAGTAC 
TATCCTGATG 
GAGTTGCTAA 
AATGACCCTA 
GTAAATGGAA 
AATCCTTTTA 
GCAAAATTTG 
CAACAAAGTG 
GACTTGAAAA 
AATGTATATG 
AGTATGTATT 
TGGCATACAG 
GGATACTGGG 
TTGGCAAATG 
AATGGACAGG 
GTTGCTAAAA 
ACTGGTACGA 
GAAGTGAAGC 
ANTCCATTGA 
ATAGAAGAAC 
TATACAAATA 
TACGAATATT 
AAAGGCTATA 
GTCATGATGA 
ATGTCGGTTA 
TTACAAAATA 



GATTGGCAAA 
TATTGAATAC 
AAGATCCAAA 
CAATCACCGT 
AAGAAGCTGT 
GTTCCTTGAT 
AAGGGTTTGG 
ACATTAGTAA 
AAATGCCATC 
CTACAGAGTT 
CACCAGAAAA 
GGTCATACTA 
ATGTAGTGAA 
GTTCGATGGA 
CACAACGTGT 
CAGACGGCAT 
CAACGATCGA 
GTGCTTATAC 
TTGCCTCTAT 
AATCTTGGTT 
CAGATGCTTC 
TTAACCAAGG 
ATAACATTGG 
GCTCTGGTCG 
CATTTGCTAA 
TAACAGTGGT 
CTGTTCTAGA 
ATACATGGAA 
TTTCAGAAAA 
ATGCATCCGT 



TTGGTTCCGC G C A ■■ 
AGATGCAGAA. AAA' 
CATACAATGG AAA 
TTTAGGTACT GGA 
TCCGTCCGTT TTA 
TCAACATGGT TAT 
ACGAATTCAG A CI 
TTTATTTAAT GT7 
GATGGTTTCT GGZ 
TGAATCGGGT AAJ 
TTTAAGAAAT GA 
TCATTTCTTT AA ; 
TAGTGCGTCA CC 
TCGAGTCCTA CAJ 
CGGAAACTAT GA 
GCTTTATTTA TA 
TCCATATCGA TT 
AGGGAAACGC AC 
AGGAATGTT? TT 
CTTATTAGAT GC 
GATTGAAACA A r 
TTCAGACAAA C-J 
CTATGTTTTT C< 

ctacggagat a' 
aattagtaaa a 

TGGGAAAACG A 
AAATACAGCA A 

taatgaccaa a 
aattgataac : 

TTCTATTGAA Z 



;.CATi 



5ATAA 



tttccgt 
aattttac 

.AAGAAGTA- 
-.GAAACAA 
CATTTATA 
AAA-ACCAA 
7GCCCAAG 
GAAAAATA 

CAATCAAG 
'ACCAGGAA 

"AGATAAAA. 

TCCTCGATA 
V7AATTCTT 
CTAATTCNA 
TTAACGAA - 



-.CTTACAAG 
i. A^^GC AG 
TTGTTTATC 
TT GAT> iAiGv.- 



AAAAAGGCAG 
AAG ATTCCGA 
AAG ATTACTT 
T7C3CAGAGC 
TAGCTAATTT 
CAT3TATCCA 
GAGATTTTGA 

cgacaccaat 
acgaatatgc 

ATAACGGAAAA 
;,;;TTTGCTCA 
GGAGAGTTGA 
CA rG 3 5TAGG 
7 7AATGAAGG 
7TAATTTGGG 
ATCGGATGAT 
' G7ATTAGTTT 
, 7C AATACGCT 
;.7TTTGTTAA 
AAACTGTTGA 
TC GCAGCTCT 
CCATTGAAGC 
GACTGTATGC 

G7ATTGTTGA 



GATTTTATTA 
TCCGCTGATG 
AGCTAACTTA 
TCAACGCCTA 
TGATGGCTTA 
TTACGGGAAT 
CTGGGAGATG 
ACAATTGATG 
GCCAGAAACG 
AACCTTAATT 
AACGTGGCTT 
AGCGTTAATT 
GCAATCTTTA 
GGTGGGGATC 
TAAAAAAGGC 
GTTTGATGAA 
CACAAGAGAA 
TGGCTCAAAT 
AATGAACTTA 
AAGTGGCATT 
TCATCCACAG 
AAGCGCAGCG 
TGATGTTCAA 
TGATAAAACC 
AAATGGTACT 
TTCTAAAAAC 
AGGTAATTAT 
GTATGATCCA 
TGCGAATCCT 
AGTAGTCGCA 
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TABLE 1. Nucleotide and Amino Acid Scqeucnces 



oi'F. facca lis Genes. 



GCGGACCCAG 

GGCAGCTCGC 

AAATTAATTC 

GTCTACAGCG 

GCAGAAGTAG 

CCAACTAAAG 

CAAGAAAAAG 

CAAACTGTGG 

CTACGTTCGG 

AAAAATGGGG 

AAACAAGTTA 

ACAGGAGAAA 

TGTCTTTTAG 



AAATTTCTGT 
GTTCAATCAT 
AGGAACAAAA 
AAGCATTGAA 
ACCAAGCAGA 
AAGTAGATAA 
ACTACACCGC 
CAGATCAAAC 
CAGTGAAGCG 
GGAATAATGG 
AGCCATCAAG 
AGAAATCGAT 
TTTTTCGTAA 



TGACCAAAAT 

TGTTAAAACA 

AGAACACCAA 

ACAAGCACAA 

AACAGAGTTA 

AACCAACTTG 

AAGCAGTTGG 

AACAGCAACG 

ATTAACATTG 

ACACTTAAAT 

CCAAGGTGGT 

CGCGCTTGTG 

AAGTAAATCG 



ATT AT C AC T I' TAA AT AGTC : 
ACTCCTGAAG TAACGAAAV ■ 



GAAAAAGACT 
ACTGTGGCAG 

cgttcggcag 
ttgaa-aat:a 
aaagtctaca 
cmgcag/v.g 
aaaaatagtg 
actagtacag 
ttcagaaaag 
attattg gtc 
aagaagtaa 



AC AGO 1 -CAA 

atcaaaca-a 
tgaagcaat 
tc aaa 1 . j aaa 
gtgaagcat 
tagaccaag 
g 3 g aaa at a 
gagttgat;:. 
ctag.: :aa7 

TTCTA 2T r 



GGGGTTAAAT 
AGCGTTAGAA 
CAGCTGGAAA 
AGCAACGCAA 
GGTAAAAGTG 
CGAGAAACAC 
GAAGCAAGCG 
AGAAGCAAAA 
AAAGGAGCAA 
AACTGGTACG 
1 TTTACCGAGC 
1 CGCCAGTGGG 



EF087-2 (SEQ ID NO:330) 



LVGLANWFRA 
DISITVLGTG 
LLKGFGRIQT 
PFTTEFESGK 
LKNWNSASP 
HTADGMLYLY 
GQVASIGMFL 
VKLNQGSDKD 
TNTFAKISKN 
MMNTWNNDQE 
DPEISVDQNI 
YSEALKQAQT 
EKDYTASSWK 
NGGNNGHLNT 
LLVFRKSKSK 



ALTDTLILLH 
LLLEDNQRLV 
ILQGSDWEMN 
ETIANLTLIA 
AQATPMQSLN 
NQDFAQFDEG 
DKSNEGMNLV 
NSWISLSAAX 
YGKTVENGTY 
IAGLYAYDPM 
ITLNSAGLNG 
VADQTTATQA 
VYSEALKQAQ 
STGVDQTGTK 



DDLLNTDAEK 

QVQEAVPSVL 

DPNISNLFNV 

KFAPENLRND 

VYGSMDRVLQ 

YWATIDPYRL 

AKKSWFLLDG 

PLNNIGYVFP 

EYLTWGKTN 

SVIEEKIDNG 

SSRSIIVKTT 

EVDQAETELR 

TVADQTTATQ 

QVKPSSQGGF 



lnkftapi: 
ksvssgd:;; 
vdkgylql: 
i ytsiqtw! 
knheyavg 

PGTTVDIr 
QIIIILGSG 
NSKNTLLV 
EE I AALSI- 
VYRLTLAi: 
PEVTKEAL 
SAVKQLVl 
AEVDQAE* 
RKASQFL; 



vakc: 
ngki 



7 T TT ( 



IT 

; i 

FK 

PL 

,EK 

VP 



ANGA' 
GTTD. 

EERS' 
GYTV 
CNWA 

liqi: 

TK EV 
F.STAV 
GEKr 



;yhffk 
-vgvy k 
ytgkrs 



C'K T 
KF L 
SI A 



I YRATGANLT 

FPYNGSYGNE 

RSISRAPETN 

KPRDFEALID 

FGNTENKKGW 

PQSWVGGSNN 

LDNRMIHPQE 

KEYFVNDKTY 

LCAIEAGNYV 

DKGILEWAA 

K.L" YTASSWKV 

KIIKENEKHQ 

NSGENKKEQK 

IGLLVIASGC 



EF087-3 ( SEQ ID NO:331) 
A ATCGGATGAT TCATCCACAG 

GAAGTGAAGC TTAACCAAGG TTCAGACAAA GATAATTCTT GGm. i~ 
ANTCCATTGA ATAACATTGG CTATGTTTTT CCTAATTINA vG^-il.-* 
ATAGAAGAAC GCTCTGGTCG CTACGGAGAT ATTAACGAAT aC'.V T-- 
TATACAAATA CATTTGCTAA AATTAGTAAA AATTATGGCA AGA-T, 
TACGAATATT TAACAGTGGT TGGGAAAACG AATGAAGAAA TJ,-A, 
AAAGGCTATA CTGTTCTAGA AAATACAGCA AACTTACAAG CCAn. 
GTCATGATGA ATACATGGAA TAATGACCAA GAAATTGCAG GA/ iG, 
ATGTCGGTTA TTTCAGAAAA AATTGATAAC GGTGTTTATC GCVTA, 
TTACAAAATA ATGCATCC 



:"'TT AAGCGCAGCG 

:gc;t tgatgttcaa 
:caa tgataaaacc 
:tga aaatggtact 

;TCT TTCTAAAAAC 
A\GC AGGTAATTAT 
ATGC GTATGATCCA 
"""rCT TGCGAATCCT 



EF087-4 (SEQ ID NO:332) 



NRMIHPQE 

VKLNQGSDKD NSWISLSAAX 
TNTFAKISKN YGKTVENGTY 
MMNTWNNDQE IAGLYAYDPM 



PLNNIGYVFP NSMNTLDV^I 
EYLTWGKTN EEIAALSKHK 
SVISEKIDNG VYRLTLANPL 



EERSGEY GDI KEYFVNDKTY 

gy7vle::tan lqaieagnyv 
;)IjNas 



EF088-1 (SEQ ID NO:333) 



WO 98, 50554 



PCT/US9S/08959 



ISO 

TABLE I. Nucleotide and Ammo Acid Seqeueiues 



TAACTGGTGG 
CA7GATGACC 
CTGTATGCAA 
ACAGA7ATTT 
GTACAAGTAC 
TATCC7GA7G 
GAGTTGCTAA 
AATGACCCTA 
GTAAATGGAA 
AATCCTTTTA 
GCAAAA77TG 
CAACAAAGTG 
GACTTGAAAA 
AA7GTATATG 
AG7ATGTATT 
TGGCATACAG 
GGATACTGGG 
T7GGCAAATG 
AA7GGACAGG 
GTTGCTAAAA 
ACTGGTACGA 
GAAGTGAAGC 
ANTCCATTGA 
ATAGAAGAAC 
TATACAAATA 
TACGAATATT 
AAAGGCTATA 
GTCATGATGA 



G A Z 1 7 G G C r^M>. 
TATTGAATAC 
AAGATCCAAA 
CAATCACCG7 
AAGAAGCTGT 
GTTCCTTGAT 
AAGGGTTTGG 
ACATTAGTAA 
AAATGCCATC 
CTACAGAG7T 
CACCAGAAAA 
GGTCATACTA 
ATGTAGTGAA 
G77CGATGGA 
CACAACGTGT 
CAGACGGCAT 
CAACGATCGA 
GTGCTTATAC 
TTGCCTCTAT 
AATCTTGGTT 
CAGATGCTTC 
TTAACCAAGG 
ATAACATTGG 
GCTCTGGTCG 
CATTTGCTAA 
TAACAGTGGT 
CTG TTCTAGA 
ATACATGGAA 



77GG7TC CGC GC. 
A G A T' G C A G AA A^-., 
CATACAATGG CG 



TCCGTCCGT7 77AAAAAACG 
TCAAC A ~ ^ ! ^-T O-.T. - . 
AC G AATTC AG A C 7 A777 7 A A 
77TATT7AA7 G77^TAGA7A 
GA7GGT77GT GG7AGAAG7A 
TGAATC G GG7 AAA 3 AAA G A_A 
777AAG/v."^A7 ^^C^. - ^ - ^- - 
7CA77TGTT7 
7AGTGCG7CA 
7CGAGTCCTA 
CGG AAAC7A7 
GCTTTA777A 
7C CATA7CGA 77ACCAG: 
AGGGAAA ^GC Aoj. . ^ ... ^. ^ 
AGGAA1 G77T 77AGA7A- 
C77A7TAGAT GG7CAAA' 
GA7TGAAACA ATCCTCA. 
T7 GAG AC AAA GA7AA77 
C7A7GTTTTT CCTAAT7 
CTACGGAGA7 AT7AA.CG 
AAT7AGTAAA AA77ATC; 
TGGG AAAAC G AATGAAG 
AAA T A C AG C A AAC 77 AC 
7AATGACCAA GAAA77G 



1 J^v, x v 



c-aat; 



TG'GC A 



ATGTCGGTTA TTTCAGAAAA AATTGA7AAC ^GT~ 



TTACAAAATA 

GCGGACCCAG 

GGCAGCTCGC 

AAATT AATTC 

GTCTACAGCG 

GCAGAAGTAG 

CCAACTAAAG 

CAAGAAAAAG 

CAAACTGTGG 

CTACGTTCGG 

AAAAATGGGG 

AAACAAGTTA 

ACAGGAGAAA 

TGTCTTTTAG 



ATGCATCCGT 

AAATTTCTGT 

GTTCAATCAT 

AGGAACAAAA 

AAGCATTGAA 

ACCAAGCAGA 

AAGTAGATAA 

ACTACACCGC 

CAGATCAAAC 

CAGTGAAGCG 

GGAATAATGG 

AGCCATCAAG 

AGAAATCGAT 

TTTT7CGTAA 



TTCTATTGAA 
TGACCAAAAT 
TGTTAAAACA 
AGAACACCAA 
ACAAGCACAA 
AACAGAGTTA 
AACCAACTTG 
AAGCAGTTGG 
AACAGCAACG 
ATTAACAT7G 
ACACTTAAAT 
CCAAGGTGG7 
CGCGCTTGTG 
AAGTAAATCG 



TTT GATAA:. 
A7TA7:AC" 

act:ctgaa 

G AAiAA A G 7*. C 
ACTG7GGC; 
CG7TCGAC, 
TTGAAAA7* 
AAA 1 
CAA 
AAAAA7AG 1 

acta:-ta:. 

TTCA j AAA. 
AT7AT7G I 
AAGAAGTA 



CA 



ATCAA. 
1' G AaG 
7 C A/\A. 
GTGAA 
TAG AC 



'G GGGAJ 



aa7: ga7777atta 
; • - 7cagctgatg 
:aaa ag:taactta 
, :aa t:aacgccta 
: : 7aatggctta 
• • : t7aggggaa.t 
:a;ga cr jggagatg 
\::77 a:aattgatg 
cagc g:gagaaacg 

A 777 AACCTTAATT 

: ;;ca a a ;gtggctt 
77ga agcgttaatt 

CAA7 GCAA7CTTTA 
A 7 G C GC-7CGGGATC 
:a;A 7AAAAAAGGC 
C7CA G7T7GATGAA 
77 A A CAGAAGAGAA 
: A A G 1 G A C TCAAAT 
AAG..„- AATGAACTTA 
;. AGG AAGTGGCATT 
7GA7 TC ATCCACAG 

a 777 aagcgcagcg 
.cact 7gatgttcaa 

■77AA TGATAAAACC 
A ! AAA AA.ATGGTACT 
;C7CT 77C7AAAAAC 
;A.AGC AGG 7AATTAT 
7-.7GC GTA7GATCCA 
,. iGT G'GCGAATCCT 
?~TCA ACTAGTCGCA 
,C7AC GGGGTTAAAT 
AiA.-A ACCGTTAGAA 
:CAAG CAGCTGGAAA 
\ : ; .AC AG C AACGCAA 
AAAT7 GGTAAAAGTG 
3 A AAA GGAGAAACAC 
:-CATT GAAGCAAGCG 
:AAGC AGAAGCAAAA 
AA7AA AAA GG AG CAA 
AA7CA AAC7GGTACG 
G/J-.77 T77ACCGAGC 
:~7A7 C3C CAGTGGG 



EF08S-2 { SEC ID NO:334} 



LVGLANWFRA ALTDTLILLH LDLLN7DAEK LNKF7APLML YAAGA 
DISITVLGTG LLLEDNQRLV CjVQEAVPSVL KS\ : FSGGGAY 

LLKGFGRIQT ILQGSDWEMN OPNISULFNV VDKGYL£LMV AGKMF 

PFTTEFESGK ETIANLTLIA KFAPEIiLRND I YTS IC.7V7.Q ;:£GSV 

LKNWNSASP AQATPMQSLN VYGSMDRVLQ KtJNEYAVGIS MYSQR 

HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTIVGTREL ANGAY 

GQVASIGMFL DKSNEGMNLV AKKSWFLLDG IULGSGIT G77DA 

VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMKTLC'AI 77 ASA 



A: CAP IYRATGANLT 

AV HAY FPYNGSYGNE 

^lAASG RSISRAPETN 

i'HFFK KPPDFEALID 

;gi:ye fgntenkkgw 

GGARS PwSWVGGSNN 
SI77I LDIIRMIHPQE 
A AC A i MEYFVNDKTY 



WO 98/50554 



PCT/US98/08959 



181 

TABLE 1. Nucleotide and Amino Acid Seqcuences of E. f'uc\vulis Genes. 

TNTFAKI SKN YGKTVENGTY EYLTWGKTN EEIAAL3ENK GYTVLEI :TAN LQA1EAGNYV 

KMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL QNNASV;" IEY DKGILEWAA 

DPEISVDQNI ITLNSAGLNG SSRSIIVK7T PEVTKEALEK LrQEQKFHQE KDYTASSWKV 

YSEALKQAQT VADQTTATQA EVDQAETELR SAVXQLVK V? TKEYDKTNEL Y I IKENEKHQ 

EKDYTASSWK VYSEALKQAQ TVADQTTATQ AEVDQAEAKL RSA^RETLK NSGENKKEQK 

NGGNNGHLNT STGVDQTGTK QVKPSEQGGF RKASQFLPST GEKKE " ALY Z 1GLLV1ASGC 
LLVFRKSKSK K 

EF088-3 ( SEQ ID NO:335) 

A ACTCCTGAAG TAACGAAAGA AGCGTTAGAA 

AAATTAATTC AGGAACAAAA AGAACACCAA GAAAAAGACT AGACCGC AAG CAGCTGGAAA 

GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTGTGGCAG ATCAAACAAG AGCAACGCAA 

GCAGAAGTAG ACCAAGCAGA AACAGAGTTA CGTTGGGGAG TGAAGCAAT7 GGTAAAAGTG 

CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA TCAAAGAAAA CGAGAAACAC 

CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAAGTCTACA GTGAAGCAT7 GAAGCAAGCG 

CAAACTGTGG CAGATCAAAC AACAGCAACG CAAGCAGAAG TAGACG7.AGC AGAAGCAAAA 

CTACGTTCGG CAGTGAAGCG ATTAACATTG AAAAATAG7G GG ^AAAJ\TAA AAAGGAGCAA 

AAAAATGGGG GGAATAATGG ACACTTAAAT ACTA I4TAC AG GA ^ TTGATCA AACTGGTACG 

AAACAAGTTA AGCCATCAAG CCAAGGTGGT TTC A ^AAA-AG :TAGCC7.A7T TTTACCGAGC 
ACAGGAGAAA AGAAA 

EF088-4 ( SEQ ID NO:336) 

T PEVTKEALEK LIQEQKEHQE KDYTASSWKV 



YSEALKQAQT 


VADQTTATQA 


EVDQAETELR 


SAVKQEVKVP 


TKEVDK 


i'NLiL 


KI IKENEKHQ 


EKDYTASSWK 


VYSEALKQAQ 


TVADQTTATQ 


AEVD^AEA-RL. 






NSGENKKEQK 


NGGNNGHLNT 


STGVDQTGTK 


QVKPSSQGGF 


RKAS.)FLP2T 








EF089-1 (SEQ ID NO:337) 










TGACAGATAC 


ACCTGCTAAC 


ACAGGAAACT 








AAGATCGGGA 


TATAGGTCAA 


AAATTTTTTG 


GCTTATCTTT 


CGGT-TTTTG 




ataa 


TACAACAAAG 


AATGACAGAC 


ATAGGAGAAT 


GAATATGAAC 


AGATGGAAAG 


TATATG 


::aac 


GGTAATCGCT 


TGTATGTTAT 


TTGGCTGGAT 


TGGCGTGGAG 


GCGCACGCTT 


CTGAAT 


:ta.-\ 


TTTTGCGGTC 


ACACCAACAA 


TTCCCGAAAA 


TCAAGTGGAT 


AAATGAAAAA 


CCTACT 


TT3A 


CTTAAAAATG 


GCGCCTGGTG 


CCAAACAAAC 


CGTAGAAATT 


CAGTTACGCA 


ATGATA 


CAGA 


TGAAGACATT 


ACCATTGAAA 


ATACGGTGAA 


CTCAGCGACA 


ACAAATTTAA 


ATGGCG 


TAGT 


AGAATATGGC 


CAAAACGGGA 


TCAAACCTGA 


CAAAACCTTA 


CGTTTTAACT 


TAAAAG 


ATTA 


TGTGGAAGCA 


C C G AAAG AAA 


TCATCTTGCC 


GAAGCATTCC 


CAAAAGACCT 


tac:tr 


TAAC 


CATTACGATG 


CCTAAAGATT 


CTTTTGATGG 


CGTGATGGCT 


GGCGGTATAA 




AAGA 


GAAAAAGAAA 


GAAACAACGA 


CTTCTGCGGA 


TCAATCAAAA 


GGGTTAGCTA 


TTAATA 


ATGA 


ATACTCCTAT 


GTTGTGGCTA 


TTATTCTTCA 


GCAAAATGAG 


ACAAAGGTTC 


AACCAG 


ATTT 


AAAATTACTG 


GGGGTTAAAC 


CAGGCCAAGT 


CAACGCGCGA 


AACGTCATCA 


atgttt 


CTTT 


ACAAAACCCA 


CAAGCGGCCT 


ATTTAAACCA 


ATTACATTTA 


ATCAACACT3 


ttt-:aaaag3 


AGGCGAAACG 


CTTTACCAAT 


CCGATACTGA 


GG AT A TG C AA 


GTGGCGCCAA 




ACTT 


TAGTTACCCA 


ATTTCTTTAA 


AAGGGGAACG 


ATTAACGCCA 


GGAAAATAT3 


t::ttga 


A.ATC 


AACGGCCTAT 


GGTGTAAAAG 


ATGAAAAGGG 


CACCTATCAA 


GTCAAAGGC j 


ccaatg 


GTGA 


AGAACGGTAC 


CTGTACAAAT 


GGGAATTTAC 


AAAAGAATTT 


ACTATTTCT j 


gggacg 


TCGC 


TAAAGAATTA 


AATGAAAAAG 


ACGTAACCAT 


TAAAGGAACC 


AATTGGTGGF 


tgtatc 


T A Z T 


GATTGCATTA 


ATCATTCTAG 


CGCTGCTCTT 


ATTGATTTTC 


TTCTTGT7l'["2 


gtaaaa 


;,gaa 


AAAAGAGGAA 


GAACAACAAT 


CTGAGCAATA 


A 











EF089-2 (SEQ ID NO:338) 



WO 98/50554 



PCI Vi:S98/08959 



is: 

TABLE 1. Nucleotide and Amino Acid Seqeucnces ^: 



12:? vrFVYATviAc 

>:lfgv;:gvea kasefffavt ptipengy— sftyf::f: 

IENTYKSATT NLNGVYEY'GC FGIKFFFTFF FFFFIFF'F.' 

kdsfdgvmag gitlkfkkf.e ttt£ ac cjs kg lai ffff f" 

\~fpgqynark yikysf:;nfc. aaylfffhf: ::tvfeggf": 

elxgerltpg kyvlkftayg ykfexgtytf fgafgfff\ 

e?;dvtikgtn wwlyluali ifallfliff lyrfffff: 



; . FFFFTFEDIT 
FTFPFT1TMP 
;7 FV~ PFFKLFG 
F' AI-'FFNFSYFT 
:'T Z SGFVAKELN 



EF089-3 (SEQ ID NO : 3 3 9 ) 



T CTGAATTTAA TTTTGCGGTC 
ACACCAACAA TTGCCGAAAA TCAAGTGGAT 
GCGCCTGGTG CCAAACAAAC CGTAGAAAT7 
ACCATTGAAA ATACGGTGAA FTCAGCGACA 
CAAAACGGGA TCAAACCTGA CAAAACCTTA 
C C G AAAG AAA TCATCTTGCC GAAGCATTCC 
CCTAAAGATT CTTTTGAT3G CGTGATGG-T 
GAAACAACGA GTTCTGCGGA TCAATCAAAA 
GTTGTGGGTA TTATTGTTCA GCAAAATGAG 
GGGGTTAAAC CAGGCCAAGT CAACGCGCGA 
CAAGCGGCCT ATTTAAAGCA ATTACATTTA 
CTTTACCAAT CCGATAGTGA GGATATGCAA 
ATTTCTTTAA AAGGGGAACG AT 



CG r ' 
CA, 



gf: 



ayt;;' 



CTTAAAAATG 
TGAAGACATT 
AGAATATGGC 
7GTGGAAGCA 
CATTACGATG 

gaaaaagaaa 
atagtcctat 
aaaattactg 
agaaaaccca 
aggcgaaacg 
tagttaccca 



EF089-4 (SEQ ID NC:34C) 



SEFNFAVT PTIPENQVDK SF'TYFDLKMA PGAXQTV: 
TTrv-rn/NSATT >3LNGWEY GQ NCIKPDKTFF FFLFF' 
KDSFDGVMAG GITLKEKKKE TTTSADCSX3 FA i FF' 
VKPGQVNARN VINVSLQNPQ AAYLNQLHLI NTVSF- 
SLKGER 



FTFPLTITMP 
KVQPDLKLLG 
APNSNFSYPI 



EF090-1 (SEQ ID NO: 341) 



TAGTCTCTAA 

GAAATGCAAA 

ATTATTACTT 

AGTCACTTTT 

GCTGCAGAAG 

ACAAAAGAAG 

GACCATTTAC 

GAAGCCAAAT 

CGTCGAGGCA 

CAGCGCTTAA 

AATGCCAAAG 

ACATTGACAA 

AAAGTAGCCT 

AAAGAAAAAC 

CCAGTTTTAT 

GGCTGGTCGA 

GAACAAACGA 

GCCAATAATT 

AAAAAAATAG 



GAAATAAACC 

TGCGTAATAC 

CTTTGGTTGG 

TNCCAACCAC 

AAAAATTAGC 

AAAAAATTCA 

ACAGTAGCAA 

TAGCAACTTT 

ATGGCACTTT 

ACCAGCAGAT 

ATTTTTATAA 

CGCTCAATAA 

TTGATAAAAC 

TAACTACTTG 

TTACAGATGT 

TTGATGGGGC 

ATGCAATCAC 

ACATTGAAAT 

TCACCACAGA 



TAAAATTATT 

ACGTCGTCAA 

ACTAGCTCTG 

AAAAGTAAAF 

GCAAGTTAAT 

ACTTCCTAAA 

GGTGAAGGTA 

GAGTTTTCCA 

TGAAATTGTT 

TATTGCGGAT 

AGCCCCTGAA 

CAAGTTAAAT 

ACAAATTCAA 

GGTGACACAA 

TCACGGCACG 

CAAAACGCAA 

TGCTCCGTTG 

TGATTTAAAA 

TGTCATTAGT 



G / * TAT AAA v 
AAA7CAGG* 

gttggtgg; 
ggagtttc 

CAAACCGAj 
AAAYAGC.* 
CCGT I'M-iA.*- 1 / 

gaggggaa 
cccgaaga 

GTTGA-^G^ 

attafaaa. 

AAAACAAT 
AACGTGCT 
TTAGAAAC 
ACACGTG' 

gaagtagf 
gttggtg; 

GGCAC-AT. 



;G ATGAAFAAAT GAAAAAAGAA 
.A A AAATAATAA AAAG AAAG TA 
)G GCAGTTATGT TTATTTTCAA 
G TAGGCTG "TT AAATGTAAAT 
..G AA.GTYGT jGT TCAAACGGGG 
,t tg:-atgaaaa ATTTTTAAAA 
:g ag 3c att gaa aaaagaacta 
,c caagcaa.aaa tgcgagtatc 
,c aaggcatagt agtggacaca 
jG gaaaagggaa ctatcaatat 
\g aggatgaaac gttaaaggca 

: A C AG T _ 3 A TAT T AATGGTG AA 

^a atgatf/tgg cacaatcaac 
fatggttc tgctaatcaa 
^aaaacaa gggaagttat 
■■■acgggct 3aatagccaa 
aaa 3 aaaa t ag t aaaatt 
:at". f-ttt cattgatggc 
aaa^aac cggaacagta 



WO 98/50554 



PCT/US98/08959 



1S3 

TABLE 1. Nucleotide and Amino Acid Seqeucnccs of E. jaccuU^ Ocnes. 

CCAGGATTCC ATACAATTTT ATATCGGACA A:CGA7GTGA AV. TTA GAAGC TCAAATGCTT 

GATGGTTCTC GATACAGTGT GCCAGTAAAA '1 A77>.- - A7 -G CG77A':~~7,AG T7AAGGGGGC 

GTTGTCACAC AAATCGGGAT TCATGACTCC GATCATAAAT T7GATGGGTA TGGCGATAAA 

GAAGCGTTTA AAACCGATGC TG G T A G T AA T GGTTGTATGA a:A7GGCAG'G AACAGAAGTT 

TCAAAAATCT TTGATGTATC CTATGACGGA ATGC GC 3TAA T r : ATT7A7GG ACATATCTAT 

GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGP ATAA 



ef090-2 (seq id no:342) 

mrntrrqk sgknnkkkvi itslvglalv aggsyvyfqs 

hfxpttkvng vsvgwlnvna aeeklaqvnq teewvvtct kehkg ?: ■pkk yqldqkflkd 

hlhsskvklp lneafkkele aklatlsfpe gkpsknasir j-cngtfeivp eeqgtwdtq 

rlnqqi iadv eagkgnyqyn akdfykapei tkeg pglx/gg lttl77j] *.1 ,1 1?; titvdingek 

vafdktqiqn vlnddgtink eklttwvtql ettygg a7g p vi etdyhgt7 rrfknngsyg 

wsidgaktqe llvnalnsqe qtnaitaplv gdtkehskia ig'yiigiggkd qkmycfidgk 

kivttdvitg rynkgtatvp gfhtilyrtt dvnlhgqhld gggyg' 7"' 7gy wgipllsqggv 

vtqigihdsd hkldkygdke afktdagsng c1ntpg7evs }'. 3 7 d7s\ ggm pviiyghiyd 
dapgefdkpv dygeev 

ef090-3 (seq id no:343) 

cac aaaagtaaat ggagtttctg taggctggtt j-j\s\ t g t aaa t 

gctgcagaag aaaaattagc gcaagttaat caaac7gaag aagttg7ggt tiaaacgggg 

acaaaagaag aaaaaattca acttcctaaa aaataccaat tg gatcaaaa atttttaaaa 

gaccatttac acagtagcaa ggtgaagcta ccgttaaacg a ; >ca g7caa aaaagaacta 

gaagccaaat tagcaacttt gagttttcca ga3gg " ? aaac caa3c777aa tgcgagtatc 

cgtcgaggca atggcacttt tgaaattgtt ccggaaggg c aaggcacagt a 3tggacaca 

cagcgcttaa accagcagat tattgcggat gttgaagcgg gaaaaggcaa ctatcaatat 

aatgccaaag atttttataa agcccctgaa attacaaaag a'gatcgaac g t taaaggc a 

acattgacaa cgctcaataa caagttaaat aaaa 77a77a ca 3jttgatat taatggtgaa 

aaagtagcct ttgataaaac acaaattcaa aacg7gctga atga7ggtgg cacaatcaac 

aaagaaaaac taactacttg ggtgacacaa ttagaaacaa catatggttc tgc taatcaa 

CCAGTTTTAT TTACAGATGT TCACGGCACG ACACGTC jTT TTAAAAA.CAA CGGAAGTTAT 

GGCTGGTCGA TTGATGGGGC CAAAACGCAA GAACTAC7AG TAAAC3C 3GT GAATAGCCAA 

GAACAAACGA ATGCAATCAC TGCTCCGTTG GTTG 3TGATA. 7 TJAAAGGGVAA TAGTAAAATT 

GCCAATAATT ACATTGAAAT TGATTTAAAA G A T C A A A AAA TGTATTGT7T CATTGATGGC 

AAAAAAATAG TCACCACAGA TGTCATTAGT GGCA3ATATA ACAAAGGAAC CGCAACAGTA 

CCAGGATTCC ATACAATTTT ATATCGGACA ACCGAT3TGA ATTTAGAAG3 TCAAATGCTT 

GATGGTTCTC GATACAGTGT GCCAGTAAAA TATTGG-ATG C C 3TTATPAAG r PCAAGGGGGC 



GTTGTCACAC AAATCGGGAT TCATGACTCC GACCA7AAAT TV3A7AA3TA TGGCGATAAA 

GAAGCCTTTA AAACCGATGC TGGTAGTAAT GGCT3TATCA A7ACGCCAG3 AACAGAAGTT 

TCAAAAATCT TTGATGTATC CTATGACGGA ATGGC 3GTAA 7 IATTTATG 3 ACATATCTAT 

GATGATGCAC CAGGTGAATT TGATAAACCT GTAGATTACG GCGAAGAAGT AT 

EF090-4 {SEQ ID NO:344) 

TKVNG VSVGWLNVNA AEEKLAQVNQ TEEWVQTG7 KEEKI KG YQLDQKFLKD 

HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKISKHAG IR FG7G77 7 I VP EEQGTWDTQ 

RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLKNKLNK TITVDINGEK 

VAFDKTQIQN VLNDDGTINK EKLTTWVTQL ETTYGG A 77"' P VLFTD7GGTT FRFKNNGSYG 

WSIDGAKTQE LLVNALNSQE QTNAITAPLV GDTKENSKI A MHYIEIDLKD QKMYCFIDGK 

KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQ'-G.jD GSKYGV l-'VKY WMPLLSQGGV 

VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CIUTFGTE7S K;77777GGM PVIIYGHIYD 
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TABLE 1. Nucleotide and Ammo Acid Seqeuer.;^ . L ■ : enes. 



'^EFDXPY ZYGEEL 



efo; 



( SEQ II 



T AA.T - GG NGG AG AT - TT .AT G C TAA.-w-.r 

GGAACAGCAG CAGCCGTTGC CGCAT ^ -^.TT ,~ 

GAT3ATTTAT G AAA TC AAA C A3ATGATTT; 

GCTGTTCAAA AAGGAACAGA ATTAA^^C-.^r 

gatcaagcct ctgatttggc aggttttgtc 
gcacaaggtg tttctggcga catgcttc-al 
gatgaattta aaaaagcagc tgacgatocl 
ggc jAAGats cagcagaaga tatctatat: 

GAAAGTGTTT CTGCTGGTGT CGATGAAGC; 
GCTGCAGAAG C AAAAG AAG A TGTTAAAC-AT 
GGGTAA 



_-vAA 



A G T A A T TG G T 
AGAATTACGT 
GACAGATTAT 
CG7TTTATCA 
ATTGGATAAA 
TGATTTATCT 
AGGT3AAATT 
AGCGGCCAAA 
TCCT3AAAAA 
A3AATTTAAA 



EFC91-2 ( SEQ ID NO:346) 



MAKKG GFFLGAVIGG TAAAVAALLL APKSGKEI 
VQKGTELTEI AK.QKAGVLSD QASDLAGSLLK ZK: 
QFKKAADDAQ DKAEDLGEIA EDAAEZIYIZ Vr" 
AEAKEDVKDA AKDVKKEFKG 



KAQDYTDYA 

LDK FKKQTGDLSD 

SAX ETTKDVPEKA 



EF091-3 (SEQ ID NO: 347) 



at caaatcaaac agatgattta aaaaacaaag 
gctgttcaaa aaggaacaga attaacagaa at: 
gatcaagcct ctgatttggc aggttctgtc aa; 
gcacaaggtg tttctggcga catgcttgat aa: 
gatcaattta aaaaagcagc tgacgat 3ct ca. ; 
gccgaagatg cagcagaaga tatctatatt ga: 
gaaactgttt ctgctggtgt cgatgaa3ca an; 
gctgcagaag caaaagaaga tg tt aaa g a t gc; 

GGGTAA 



AGATT. 
YCGG 
ATTC 
LAGG 
ATTT 
TGG 



CGTTTTATCA 
ATTGGATAAA 
TGATTTATCT 
AGGTGAAATT 
AGCGGCCAAA 
\TGT TCCTGAAAAA 
-AAA A 3 AATTTAAA 



EFC91-4 (SEQ ID NC:348) 



SNQTDDLK NKAQDYTDYA 

VQKGTELTEI AKQKAGVLSD QASDLAGSVK E?L 
QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VK 
AEAKEDVKDA AKDVKKEFKG 



■la qg'\-:;::l:ldn fkkqtgdlsd 
le tysagl'zeax ettkdvpeka 



EF092-1 (SEQ ID NO:349) 



TAAGGGGATG AAGAAAAAAT GGCAAAAAAA AC 
AGCACGAGTT TATTAGTAAC AAAAATGCAA AA 
GACATCTTTG CAGTATCGGC TTCTGAAGCA GA 
GTTTTACTTT TAGGTCCACA AGTTCGTTTC AT 
CCAAAAGGGA TTCCTTTAGA TGTAATTAAC AT 
AAAGTTTTAG ATCAAGCAAT CTCATTAATG GG 



;ttc cgcaggaatg 
■tgg catggaagca 
,taa agaggtgaat 
--ac a aaaattacaa 
;gat gaatggcgaa 



ef092-2 (seq id nc:350) 

makkt imlvcsagms tsllvtkmqk aaedrgi- 



.vsasea: 



LrCLENKEVNV 
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TABLE 1. Nucleotide and Amino Acid Seqcuences -V Jaccaiis denes. 

lllgpqvrfm kgqfeqklqp kgipldvinm adyg:-vy:g yy \yv ; a v :,mg 
ef092-3 (seq id no:351) 
ag aagatcgtgg catggaagca 

gacatctttg cagtatcggc ttctgaagca g ata c aaa. '.'t tggaaaataa agaggtgaat 

gttttacttt taggtccaca agttcgtttc at- ; aaa. a' a • y aatttgaaca aaaattacaa 

ccaaaaggga ttcctttaga tgtaattaac at 3 gc ag att atggcatgat gaatggcgaa 

aaagttttag atcaagcaat ctcattaatg ggat 

EF092-4 (SEQ ID NO:352) 
EDRGMEAD IFAVSASEAD TNLENKEVNV 

LLLGPQVRFM KGQFEQKLQP KGIPLDVI KM AYYGXYVJGYK VLDQAIYLMG 
EF093-1 (SEQ ID NO:253) 

TAGTTTTTTT CCGATAAAGG GAGAATTTTA ATGAGC 
TTCACGGCCA TGGCCATTGT TTATTTGATG AGTYV.I 
CGTCAGTTAA CACAAAAATT TACGCAAGCT AC C CAY 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA AATAG/ 
CTCGTATACG CGCAAGTGAC CCTCGATTAT ACA7A:" 
GTTTTATTAA ACAAATCTGG TCGAAAATAC CAA7A" 
AAAGCGGAAA CAATACTGGA ATAG 

EF093-2 (SEQ ID NO:354) 

M RQKYSGNLLF TAMAIVYLMS FLALQLLEER C^C\K~~Y AT QEYVAGKSIF 
HLFLADVKQN RRKLKTEERL VYAQVTLDYT Y? NEQ. ,R IVY LLMKSGYXYQ YQERVSHQKK 
AETILE 

EF093-3 (SEQ ID NO:355) 

CCTTCAGTT AC TAG AAGAA 
CGTCAGTTAA CACAAAAATT TACGCAAGCT 
TTTCATTTAT TTCTTGCAGA TGTTAAACAA 
CTCGTATACG CGCAAGTGAC CCTCGATTAT 
GTTTTATTAA ACAAATCTGG TCGAAAATAC 
AAAGCGGAAA CAATACTGG 

EF093-4 (SEQ ID NO:356) 

LQLLEER QLTQKFTQAT QEYYAGKSIF 
HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKME;;VYVV LLYKY'^KYQ YQERVSHQKK 
AETI 

EF094-1 (SEQ ID NO: 357) 

TAAACATTTG AGACATTCAG AGGTGAATGT CTCTTTTTTA TYAC'IY .--.AAA ACGAAAGGGG 
ATTAATTATA TGAAAAAAAC AACATTTAAA AATTGGTCGT TATTTGCGAC TTTGGCTCTA 
TTAAGTCAAA CAATTGGCGG AACGATTG^T CCTA "G.AT'J'G CTTTTGCCGA TGAAATTACT 
CACCCTCAAG AGGTAACAAT TCATTATGAC GTAAGTAAAC TGTATGAAGT TGACGGAACT 
TTTAGCGATG GCAGCACGCT CTCAGAACGT ACTACGVCAT TATATGCAGA ATACAATGGT 
GCAAAACAAA CAGTATTTTG TATTGAACCA GGTGTYAYYA TTCCAAYAGA AGTGACGCAC 



;CA,\A AATATTCAGG AAACTTATTG 

. yvg cccttcagtt ac tag aagaa 
aga.at actatgvagg gaaaagtatc 
■ c g a a g t 7aa.a aac ag aag aaagg 
: a a .-" a a tg a a. ■: ■ a att aagaa taact 
;yaag AGAGAWTTC TCATCAAAAA 



ACCC ACGAAT 

aatagayg;v, 
acata::aaaa 

CAATATCAAG 



ACTATGCAGG 
AGTTAAAAAC 
ATGAACA ATT 
AGAGAGTTTC 



GAAAAGTATC 
AGAAGAAAGG 
AAGAATAACT 
TCATCAAAAA 
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TABLE 1. Nucleotide and Amino Aj: 



T G G 3 AAAA 1 5 G 
GnAGTGAAt 3 
AAATCTATTG AAGGAAAAA - 
TG. AAAAA C 
'TGAGTTTGA 
7GGAA7 1' AA77 AGIXS 7T 



CA7AATACCA 
TTAAATTTAT 



aaaaaatcag 

ATGGCTGGTG 



AATT7T; 
TkajG7 - '. 
TACT :7G 

aact:c, 

AAT 7GA' 



CTGGTACT7G 
CGCTTGATAA 



G 3TT - TTTAA AGATC AAAAA 
TTC 7ATTTAG ATTTTGGGAA 
ATTT7TATTT TGGATGGAAT 
CCAGATCCTT ATATGATTGA 
ATTTCCATGA CTTCG AAAAA 

gta3aaacag gtactgatc7 
att7gtaaag acagccca7 7 
cgt3cggaaa caccaaaaga 
gaaactaaat ctagtaatg 3 
tatg7Caatg aaaccgtggc 
actggggaaa ccactttgac 
gctgagttta aaggagctga 
tggagtgaag cttttaaaac 
actttggctt taga7gaaaa 
ttctggcaag aaaccaaagc 
tccatcaaaa aagttgataa 
aaagaacaag ttattcgctt 
actgccgaaa ctggatttaa 
gaaatcacag gtgctgaaga 
ggctatggta agtttgaaaa 
ccagaaggat ttcaaaagat 
gacgactatg cgaagagtga 
attaagatgg tgaccgttcc 
aaccgtttga tgc tttatg a 
tggaaagacg gaaataaaaa 
agatataact tgcatgaaat 
gaagccacaa aagctgccca 
acaaccgcaa cgttggcgaa 
accgctgaac aagttttgga 
gtagcctttg aagcaggcaa 
caaaccgtca attgtacgat 
gatggttcgc aaacttttac 
acccatgatg tactggatgg 
ccagatggta cgaacaaaga 
aaagaattta ccaaaaccgt 
actaagttta cttttacgga 
aatgaagatt tgaaagaaaa 
agtacgccaa aacaaccgga 
acagtgaaga cattcccgca 
tttatcttga ttttttcgac 



TAC7A7A:"77 7.TG77TG77A CC 

7AT77GG aa aa.-, . / ga tt 

TGGT2AAG.7T 7 77 7;_A7AAA TA 

GCTT377AA7 G 7777 GGA7,7 TG 

TCTT577AC7 AG7AA -M : 7 AA AA 

AAAA 3 AA 1 7 A C ; GG*. GAT A 7 7 G GT 

ATATAC77TC TT7AC7GC.GA AA 

AGAAT7A :;TG AAGGGAAGGA AA 

gaac-:aa7T7 gccgttaaac ac 
acctga^gaa :;g\:' :gg g; at 

TAAC3AAAGGi. AA7G7' 'GTAA '7'*' 
TGGC7777A7 TT 7 T77AGC.T 77 
CGAC7 r : 

taaag: 

rr>fp ' 

TACACl 

gtatg: 

ttacgagaaa cta- 

TTTGC 7 7' GAG AAA : 

ATTGAATACC GTT: 

CAAAGAAGAC 7GCG 

AGAAAAAGAG ^ AAAAA 7 'A A. A ' 

CAAAGA3AAA ACT7GAA77T GG' 

taaaag7atc gtct7g7t7a at" 
tgagcca77a gcgaagga7g c7- 
tgaacg:::a7 gtt7G7A7 7c aag 

TCATGGTGAC '-777: AG 7A 777 
CTCAAjt_'- i 1 G.'-i>\ 7 ■~7 1 77 .-^r.-: ,-^A 

aattt g g aaa tc7gg7aaaa 77; 
acttgcg3aa aaagtag.g7a c7c 
aatcaa7tac 3aaaag,gatg gaj 
atctcaaacg ttaa..ca::7aa aac 

AACACCAGCT 3TTCCAAGTA AT r 
AACTGGGGAG AAAAA777 7A AG; 
TGCTGGGTAT 7A77'7 7 . A7A A77 



s TG 7 
; AC 7 



:tti 



_a gt: 
;a cc: 

:G TA7 



\GAG 
7TCA 
7 TG A 



\TC 

r^A 



GV7A 

:ttc 
:ttc 

7AGT 
CAGC 

7TCT 
7 7TA 
7GAA 
7AGC 



'GGA 
7GTA 
7TGA 
7GTA 
7GAA 
"GCC 
7 AG A 
TACT 
7CAA 



; 7 .7 7 7 T G G G AA 
AG77GATATA 
AGCAAGTTTT 
T A AAAA TG AA 
T7ACCGTGTA 
ATTAACATTG 
TCAAACTGTG 
GGAAACTAAG 
AGAAACGGTT 
T7-AAGATGGG 
AG vAA T C G G TG 
GGGCGAGACC 
GAAGACTGGG 
TACATTTGCC 

tgaaaaaggt 

r r7ACGTGACA 
C7AGTTAAAA 
C 'AAGAAATT 
TJAAGGGAAA 
A J CTGTT AAA 
TG AAA C AG TG 
TAACGAGTAT 
GTATCCTGTA 
T 7T7ACGGCA 
G7C7GATGGC 

A ;ggaccaan 

AGGTTTTGAT 
,h."i7AGAGG CT 
G "AAAACAAA 
AAAACAACCA 
TGTTAGTCTG 
T7TTGCGACT 
TGATAAATTG 
CATTGATGTG 
GATTGCCGAA 
GCATAAATTA 
TGAAAACAAG 
CAATCAAGCA 
CCACCTAGAA 
TGTGTCGGTT 
TGCTTTACTA 
AGTGAATGAT 
TG CAGAAGGA 
7 3GAAAACAC 
AACCATACCG 
ATCTAGTCCC 
GTTAGTTGGC 
TTAA 



EF094-2 { SEQ ID NO:358) 



MKKTTFKN WSLFATLALL SQTIGGTIGF 71 
PQEVTIKYDV SKLYEVDGTF SDGST7SE7T 



:epg vsiptevthg 
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TABLE 1. Nucleotide and Amino Acid Seqcuuucs E. feccaiis Genes. 



YQKNPLPSKS DKAKLVSVLW 
SIEGKINKAI EEYQKKPSFH 
GNQLVLTPNS NSKSGTLTLK 
SLK IKKIDKE SGDIVPETVF 
DPYMIDTTPM AATIKAGETI 
RKD SPACE IV QEITTDEKGR 
ANQTVALVTS NVKGQNQEIT 
SEAFKTELVK GTKASDETVT 
IKKVDNNEKN AVITRDVTAK 
ITGAEDKATT ACNEQLGFDG 
DYAKSEYVFT ITEEGQKQPI 
KDGNKKLNTL DFTELVDKLR 
TATLANKEKT GTWKILHKLT 
TVNCTIERHV SIQTKAHLED 
DGTNKEIWKS GKIEHEVNDK 
EDLKEKSQTL TPKEVPTIPS 
ILIFSTAGYY FWNRRN 



ekagti I def igg- iee e. ge 
nttvktilGv' eye:, 
ksagtgtpva yfi i eye 'i: 
hldfgfaeeg kfyt': efeg: 
smtsk i imk ■:;■>; g;< ] leeei gv 

AETPFFLAFA LELGYY YYTE 
GETTFTFEEF DTGFEG^'GEA 
LALDEFI J 0 1 / A VE H L A I II E E F 

eqvirfofee ffea. --fa: 

YGKFENLPYG DYLLEEIEAP 

kmvtvt-yefe efee • : \v ; .n 
ynlh e : ke ew Y' a r a e a i e y e 
aeqvldfe : v LFrrAE/EFFY 
gsqtfthg: :*■ • effete" ~e it 

EFTKTI I j A EF 1 7 1 EE ; K Y P E GT 
TPKQPETPAY PFFGEEGAI T 



egg g e eh s if rlggasvdik 
fegeee g \g7 (, > ntanidyrvi 
aga_:eeyity aikinvetkg 
s ;:..f:-' ; ^iir fvtiteksvp 
etgtelvjndn yslagntfai 
tkf g eg gvmt fkptkvelky 
efkgaeytlf takdgqavkw 
WQFTKAPEGY tldetkypvs 
A FTI Fi J OFF F F V S PLEGTXE 
EGFHFITPLE IRSTFKENKD 
RLjI :Y FLPEF. EDSLTSLATW 
ATFAAGEKDE FAKPWIAET 
A F E; v G !•] E P V A F DASLNNQAQ 
FGEI EGGKFA FETILYALLP 
KFG''~TEII JYE EDGIJVNGKHN 
vf^'" etgek IJSNVLLLVGF 



EF094-3 (SEQ ID NO:359) 



CGA TGAAATTACT 
CACCCTCAAG AGGTAACAAT 
TTTAGCGATG GCAGCACGCT 
GCAAAACAAA CAGTATTTTG 
GGTTATCAGA AAAACCCTTT 
TGGGAAAAGG CTGGAACAGA 
GAAGTGAACG GTTATAAACT 
AAATCTATTG AAGGAAAAAT 
CATAATACCA CTGTAAAAAC 
TTAAATTTAT CTGAGTTTGA 
ATTGGGAATC AATTAGTGCT 
AAAAAATCAG CTGGTACTGG 
ATGGCTGGTG CGCTTGATAA 
GGTTCTTTAA AGATCAAAAA 
TTCCATTTAG ATTTTGGGAA 
ATTTCTATTT TGGATGGAAT 
CCAGATCCTT ATATGATTGA 
ATTTCCATGA CTTCGAAAAA 
GTAGAAACAG GTACTGATCT 
ATTCGTAAAG ACAGCCCAGC 
CGTGCGGAAA CACCAAAAGA 
GAAACTAAAT CTAGTAATGG 
TATGCCAATC AAACCGTGGC 
ACTGGGGAAA CCACTTTGAC 
GCTGAGTTTA AAGGAGCTGA 
TGGAGTGAAG CTTTTAAAAC 



tcattatga:: tiaa ieaaag 
ctcagaaggt acta i iteat 

TATTGAAG ::A IGTGT'EAG'TA 

gccatgaati tgtgataaag 

tattgata::a aatatggteg 

ccattigagag a^iaa iaeeag 

taataagg :a aiyiaeigage 

aattttag i e ■: aat : a .--.a. 

taaagtggee: iaIvAaeaggg 

tactccaam tctavttcca 

AACTCCAETC GCTTATAAAA 
GCCCAATA EI TACG A PA' TEA 
AATCGATAAA GAATIAGGEG 
AGCTTEACGT TIAAAAGATG 
TCCCCATGGT AGAAAGGTAA 
TACCACACIC ATGGCEI-CCA 
TATG C G A C AA AAA I I E I AAA 

ttggaatgai aateaeegeg 
tggtgaaatt geccgaga/,a 
gcttg2taat giegegia.ii 

TTTCGTGAAT A I IE ! I AAE% I 

TCTTG TEA G I A IE A A G GI ' I A 

AAAAG AAG A I AAAGACAC.:i 

ATATAGTI I I TETACEGCAA 

AGAATTAGEG AAGGGAAC IA 



TGT \TC AAGT TGACGGAACT 
TAEAEGEAGA ATACAATGGT 
ETC IAACAGA AGTGACGCAC 
CG,EAA ~ TAGT ATCGGTTCTT 
i": A! AAA A AG AT GATTTGGGAA 
GTG II EETTC AGTTGATATA 
EE :: AAAAAAA ACCAAGTTTT 
CT r : 1 . a ' I AG A T AAAAA TG AA 
CGAACATAGA TTACCGTGTA 
AAT IAGGAAC ATTAACATTG 
AAG IAGGACT TCAAACTGTG 
A AA ' EE AE \ T G T GGAAACTAAG 
ATATEG TAG C AGAAACGGTT 
TG A IAACAGA TAAAGATGGG 
GTATIIACTGA AAAATCGGTG 
CCATEAAAGG GGGCGAGACC 
TT I TEE' J 1 AG A GAAGACTGGG 
TA 1 1 EGG AAA TACATTTGCC 
TAG* 17-Gv"! IGA TGAAAAAGGT 
TGI IAAGGTA TTACGTGACA 
C A. A I A A. A A. GT CGAGTTAAAA 
A A - 1' ; I .-AAA CGAAGAAATT 
GTAATGAGA3 TCAAGGGAAA 
AAI AEGGTIA AGCTGTTAAA 
AAGGTTCTGA TGAAACAG 



EF094-4 { SEQ ID NO:360) 



DEITH 

PQEVTIHYDV SKLYEVDGTF 

YQKNPLPSMS DKAKLVSVLW 

SIEGKINKAI EEYQKKPSFH 

GNQLVLTPNS NSKSGTLTLK 



SDGSTLSFr E TSEEAEYF IA 
EKAGTL ILIAC IF/AQKE IEEE 
NTTVF TIL G G GTE E I E V. ; ■ : E L 
KSAGTGTPVA YKFAGE I I"IH 



FO'TYFC IEPG VSIPTEVTHG 

E'TIIYFEEGIK RLGGASVDIK 
FI GIG -' -" NTANIDYRVI 

AGALEEEFTY AIKINVETKG 
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TABLE 1. Nucleotide and Amino Acid Sjolv-j:-.. j. . ,.-cnes. 



ELKIKEIEKE SGEIYPETYF HAYFGrAEPA 

cpykiettpk aatikaget: s:-:zski^irqy. 
kkdspageiv qeittaekgp aetpkelatca 
anqtvalvts ^atkgchqeit getteteeef 
seafkteeyk gtkasdet 

ef095-1 (seq id no: 361 ) 

taagaattgt tggattgttc tttagaaaga 
gaattgatag taacgggcat ctgccatata 

GTTTTTGCAG AAA C ATT AT C AAGTA'^AA>^. 
AC AG C AG AAA AAGCCGAAAG TGAACAACCA 
ACACTGGCAT TGTCAAAAAG TGAGTTAATC 
ATT AG AG AAA GAATTGAGAC GTCTAACCTA 
GGGCAGCCAG TAAACGCCAA TGAGATCCTT 
CCAGATGGCA T AAA TGTGTG GGAAGGTGAA 
AATTTAAAAG AAGTGGTAAT TCCAACTGAG 
GTGCTTGCAG CGAGTAATCA AACATTTTTT 
TACAATAAGA AAGGGGAAAT TGATCCCAAT 
GGAAACCAAT ATCCAACAAC AATTTCGCAA 
TATAGTCAGA AAACAGGAGT AACGTTTAAC 
TTGTACAACC AAGTGAAGGT TGATTCATCG 
TTTTCAGGGC CGGTTTATTA TCATGTTACC 
ACTCAAGGGA AACCAATCCC TCCACCACCG 
GAGCGTGACC CTTACACCTT T AAA C AG AAA 
TCAAAAACGT ATCAATTTCA AGGATGGTAT 
AAAAGCGTAA CGCCCAGTTA TGATATTACY 
TATAAGGAGA TACCTCAAAA AAATTATACA 
CC AC CATC TG ATTTTATTCA GGATCACCAA 
TTAGCTGGAA AAAAACTGCC ACAACAATAC 
GGTTGGTATC AAGATAAAAC NAAACAAGAG 
TCCCCTGTTT TTAATGAAAT GAACGCTATT 
GCTGAAATGC AAATAGAAGG ACTAGTCAAA 
CAGATTATGC TTACAAATGT GGGAGAAGTA 
AGTGGTTGGT CACCAGGTCT AGCTCGGCCA 
CCAAACAAAA TTGTTCCTAT TACTGATGAA 
GAAGTGCCTA TTGGTCAGAC AGCAACTATT 
GATCAAGTGT TACAAGCGGC TGTTGAAATG 
GATACTGTCA GAATCCAACC TAAAAATCAA 
ATCAGCACAC CAACTTTTGA TTTTGGCAAA 
GGTTTAAAGC AGGCAGCAGA TTATTATGAA 
AAAAAATCAC AACCCAATTG GGCACTAACT 
GATCAACTAT CATCAATGAC AAAGTTATTG 
CAGTACAATC AACCAACGGA AACTAAAGTT 
GTTGCCAACG GTGTAGCTAG CCATATTGTT 
TATCAATTTG ATTTTTCTTT TGATCAAATC 
AAAGATCAAA CTTATCAA.GC " AATGGTGACT 



7 A 7 7_ 7, 7' ! HGC FYYTTEKSY? 

■ y et'-tiy ■■:::c:: yclagntfai 

7" AT 77YY- -7 7'::: FFPTXVELKY 

, ;. F7: Y A7YY TA7FC .5 QAYCKW 



aaa .. .a t:aaag:gaag taaatggaaa 
7 7Y.,7;y7' y cyata::yaat acagacaact 
. aa.a, . . ^ a- a aaa. aaa tyattcatta 

1AAA 7.7..' -A AYAAAEYACA TG AT jAAGAA 
:Y."7Y.A': . 7. A- CTAATATTAC AAGTCAAACG 

; '7.7:7 ' ay atyyaayyat taatgaagag 
CTAIACTAT: atagttggga aggcaattcc 
agt7ya.-'.:z"ag tg a 77 a 7 catc tacagtggct 
aaa a 7 : ycya^' vccga catgtcaacg 
ta a ■ a attatacttc tytaagctta 
tac7a77a7y at: yaaytatttc cgacgcatca 

YTIAAYAA' A- AAAAAATGTC TGCACAACAA 
/Yl TAG 7' AA AA GTCAAAAACT AATCGTTCCT 
aat:.aayctg GGCTATTGAA TTACTTTAAA 
AAT. A 7YYA TGACAYAACA TTTTGTGGAT 

gaayyyaaac aaggaaa:;ca aacacttatt 

GAT TTT" C CAAYTA77CTA T3AAATTGAC 

ay a a a 7. . .7 a, 77 y aaa 7 ata a aaa ttt ag aa 
tat. - a; a7a a tgat .attt aactgttgtc 
tttgag aa tg tcaatggt3t tgaaattgca 
yy. -aa -.a a . "y aay 7?g ctt tggctat 
agccttaaag gyaaaaatta tttatatcaa 
aa aaa. aaa aaaygaagag acccataaac 
7a a ' 7- " ~t a y a a g a aaa t aaatgcaaaa 
gtc aaa" 7; 7 ata cyccttatat a < a aaatttgg 
yagaaaaaaa aaataaactt aaagccagca 
a t c c aa g 7; a: a cg a tty g tgt tggatc tg aa 
aattgyaaag ttggcattac tttaaatacg 
atgatgacaa caattgctac aggtgaacca 
aa tg g aaa tt tttctgctgt taacgcagct 
7aaaataaaa7g cacaaaatga ggaaggtttt 
gcagaaatt atagaaacac gaagcaacat 
aatgga aaag aaaaaa a aata tttacgtttg 
azaga at'tat a7aa7attga aaaaaagagtg 
ttagcaacaa ccaatgtttc aggttttatt 
— -"a a a ac a -vccgc tattcaatta 
ataaayaayc aataaaacga aagtgatgtt 
aaa tt 7: 7' a 7 a tz 77 c a a c aaa tcaagg taga 
TGGAATTTAG tgacaagccc ataa 



EF055-2 ( SEQ ID NO:362) 

MKRSKWKE LIVTGICHIL VFPILICTTV FAET7 .PET-' A Z'RFGTAHAAC 

AEKAESEQPQ TKDKL.HDEET LALSKSELIC IZEA7AAA7T: EEEIFTFMLT YEYGFINEEG 

QPVNANEILL QYHSWQGNSP DGINVYJEGES A ZAA A. 7 7Y: EPF/A 7 7 PEEK VAVYSDMSTV 
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TABLE 1. Nucleotide and Amino Acid Sc-qeueivc- r'.~ E. jcca:!:-: Genes. 

LAASNQTFFL PRYYTSLSLY NKKGEIDPNY PLATIAAASG NQYPTTISQF ELEKMSAQQY 

SQKTGVTFNI SESQKLIVPL YNQVKVDSSN CISALAIA'AAF SGPVY V HVTN RKVTEHFVDT 

QGKPIPPPPG FRQGKQTLIE RDPYTFKQKD LA A YE : AA KTA ?F /GIAYK GKTKPENLEK 

SVTPSYDITY DDNDDLTWY KEIPQKNYTF EAYAA""L: A? PSDF1 ITKOQ PITTDGFRYL 

AGKKLPQQYS VNGKTYLYQG WYQDKTKQEA LEH1 ?T- A I I ] A PYFNF.iAKAIT AVYKEITAKA 

EMQIEGLVKV MPSGYIQIWQ IMLTNVGEVP LKKIALAAAS GWSPGLAPP1 QVTIRVGSEP 

NKTVPITDEN WRVGITLNTE VPIGQTATIM MTTI ATGFPD ^YLOAAVEI-TN GNFSAVHAAD 

TVRIQPKNQE IVAPDEEGFI STPTFDFGKV AIASNTQCHG LKQ^-DYYEN GOENPYLRLK 

KSQPNWALTA ELSPFEGRVD QLSSMTKLLL CTTl-A-C A AA YHC ATETKVA LGKTTAIQLV 

ANGVASHIVA NGQFDESDVY QFDFSFDQIK LEIAAIAAARA IX-TAC'AI-IYTW NLVTGP 

EF095-3 (SEQ ID NO:363) 



AAGTACAAAA 


CAAGTAAGAG 


AAGGAACCAA 


TCAA" 




CC, . 










ACAGCAGAAA 


AAGCCGAAAG 


TGAACAACCA 


CAGA: 


'AA 






AA, 


ITACA 


TGATGAAGAA 


ACACTGGCAT 


TGTCAAAAAG 


TGAGTTAATC 


gat/-. 


/IT- 


i 




AT-' 


: TTAC 


AAGTCAAACG 


ATTAGAGAAA 


GAATTGAGAC 


GCCTAACCTA 


act:, 






ATG 


J A 


TTTAT 


TiAATGAAGAG 


GGGCAGCCAG 


TAAACGCCAA 


TGAGATCCTT 


CTAA, 


- G T 


rp 


ATA 


JT r 


AGCCA 


AGGCAATTCC 


CCAGATGGCA 


TAAATGTGTG 


GGAAGGTGAA 


AGTC, 




GAG 


TGA 


! A' 


/CATC 


TACAGTGGCT 


AATTTAAAAG 


AAGTGGTAAT 


TCCAAGTGAG 


AAAG' 


"AG 


AC A 




ATTCCGA 


CATGTCAACG 


GTGCTTGCAG 


CGAGTAATCA 


AACATTTTTT 










•u . 




TTTAAGCTTA 


TACAATAAGA 


AAGGGGAAAT 


TGATCCCAAT 


TAT'": 




Tg ": 


AAA 


.IT 


ATTTC 


CGACGCATCA 


GGAAACCAAT 


ATCCAACAAC 


AATTTCGCAA 




lAT 


t :• .:; 






ATGTC 


TGCACAACAA 


TATAGTCAGA 


AAACAGGAGT 


AACGTTTAAC 


ATT.'v 




A 3 A 


gt :.: 




\AACT 


AATCGTTCCT 


TTGTACAACC 


AAGTGAAGGT 


TGATTCATCG 


AAAT 


•A,T 


at:; 


G A 




ITGAA 


TTACTTTAAA 


TTTTCAGGGC 


CGGTTTATTA 


TCATGTTACC 


AAAT I 






T A A 




A AAA A 


TTTTGTGGAT 


ACTCAAGGGA 


AACCAATCCC 


TC CAC C AC CG 












\aaca 


AACACTTATT 


GAGCGTGACC 


CTTACACCTT 
















TGAAATTGAC 


TCAAAAACGT 


ATCAATTTCA 


AGGATGGTAT 


AAA .I 






AG A 


aa 


a.ttga 


AAA TTT AG AA 


AAAAGCGTAA 


CGCCCAGTTA 


TGATATTACC 






: - A A 


ATG 


AT 


A A TTT 


AACTGTTGTC 


TATAAGGAGA 


TACCTCAAAA 


AAATTATACA 


TTTG 






tcaat 


GGT jT 


TGAAATTGCA 


CCACCATCTG 


ATTTTATTCA 


GGATCACCAA 


IA*A I 




TAA 


cta 


IG 


A AT jG 


CTTTCGCTAT 


TTAGCTGGAA 


AAAAACTGCC 


ACAACAATAC 


AGC j 






f ATA. 




A I TTA 


TTTATATCAA 


GGTTGGTATC 


AAGATAAAAC 


NAAACAAGAG 


AGCT 




AAV A 


>AAA 




A AG IG 


ACCCATAAAC 


TCCCCTGTTT 


TTAATGAAAT 


GAACGCTATT 


AC AG 






ATA 




GAAAT 


AACTGCAAAA 


GCTGAAATGC 


AAATAGAAGG 


ACTAGTCAAA 


GTCA 










TATAT 


ACAAATTTGG 


CAGATTATGC 


TTACAAATGT 


GGGAGAAGTA 


CCGT 


FAA- 


i"..-,-f-i 








AAAGCCAGCA 


AGTGGTTGGT 


CACCAGGTCT 


AGCTCGGCCA 


ATA A 










CGTGT 


TGGATCTGAA 


CCAAACAAAA 


TTGTTCCTAT 


TACTGATGAA 


AA.TT 






TTA 




ATT AC 


TTTAAATACG 


GAAGTGCCTA 


TTGGTCAGAC 


AGCAACTATT 


ATGA 










A AT AC 


AGGTGAACCA 


GATCAAGTGT 


TACAAGCGGC 


TGTTGAAATG 


AATG 




A; A T 


TTT 




ACTGT 


TCACGCAGCT 


GATACTGTCA 


GAATCCAACC 


TAAAAATCAA 


GAAA. 


r:\: 








A AT 2 A 


GGAAGGTTTT 


ATCAGCACAC 


CAACTTTTGA 


TTTTGGCAAA 


GTA A 






:ta 




/■.A I AC 


GCAGCAACAT 


GGTTTAAAGC 


AGGCAGCAGA 


TTATTATGAA 


AATG 


j> .i 




AA A 




A I ATA 


TTTACGTTTG 


AAAAAATCAC 


AACCCAATTG 


GGCACTAACT 


GCAG 










TTT AA 


AGGAAGAGTG 


GATCAACTAT 


CATCAATGAC 


AAAGTTATTG 


TTAA 










ATTTC 


AGGTTTTATT 


CAGTACAATC 


AACCAACGGA 


AACTAAAGTT 


GCTC 






AAA 




A I A AC 


TATTCAATTA 


GTTGCCAACG 


GTGTAGCTAG 


CCATATTGTT 


GACA 






AG TTT 


AAA A A 


AAGTGATGTT 


TATCAATTTG 


ATTTTTCTTT 


TGATCAAATC 


AAA." 


taa 


-if-. 


TTC 




AC AAA 


TIAAGGTAGA 


AAAG ATC AAA 


CTTATCAAGC 


AATGGTGACT 


TGGA 










AG A AC 


A 



EF095-4 (SEQ ID NO:364) 
STKQ VREGTNHSLT 



WO 98 50554 j>n 7US98/08959 



T ABLE 1. Nucleotide and Amino Acid Slv^-.-j: ; ■ .7 . . 7:,r.cs. 



AEr^.^SE^ F7 










TFEE7ANEIL7 






... . ... v 


EEAE' ESDMSTV 


LAASI-iQTFFL. 


P FEl" Y 7 £ E S 7 Y 




. . . . _ , , , . , „ _ 


E G 7 EGM S AQ Q Y 


aggaggvtfi;: 


SES^P'.LIVrE 






T TEKFVDT 


7GFFIPFFFG 




AGPYGFKEXG EG.. ' " 




GKYFPEaLEK 


GE'GTS YD I TV 








EE77DGFRYE 










EEEE7GE ITAKA 


EM7IEGLEEKY 


KFSGYIQIW : 






;v7:f.vgsep 




VVRYG I TLEE7E 






gaasayhaad 


T7RIQPKNQE 


IYAFDEEGFI 


ST?7?CFG V V - " 




GIENPYLRLK 


KSG PNWALTA 


ELSF FEGRYD 


E'Lssktkglg : : ■ 




GC7E7TAIQLV 




NGQFCE£EA r i' 


^'FGF ~FD~ !?■ : ;,: 




agytgp 



EFEJ96-I (SEC " NO: 3 55) 



7GAGGTGGCC AAGTTAAAAT GAJ^AAAATTA G 

ATTGTCATCT TGT7TTTTGG EGTGCGC CAA 7 

GATACCTTGA CCATTTACAA TTGGGGGGA : : 

GAAAAAGAAA CAGGCTATAA AGTCAATTAG G 

A C AAAAATT C AGCAA^GTGG CACAGCCTA7 G 

CAAAAAATGA TGAAA :;CGAA GATGE7TTTA : 

GAAAAGAT7G ATGCACGCTT TTTAGATCAA. 7 

CCGTACTTCT GGGGCACGTT GG3GATTATG E 

ATCCAACATT GGGATGATTT ATGGCGCCCG G 

GGGGGTCGCG AAGTGTTAGG ATTAT7TTTE A 

AAGGACCAAC AATTACGTGA GGGTAGGGAT A 

GCAATTGTTG C G G A TGAAA T CAAAATGTAC A 

ACTTTCTCTG GTGAAGCTGC TGAAATGC TA G 

CCCAGTGAAG GETCTAATCT CTGGTTTGAT A 

AAAGAGGGTG CCTATGCATT TATGAACTTT A 

GGAGAATATA TTGGTTATTC CAGA 7CAAAT A 

GTTGCCGAAG ATAAAGAATT TTATC ZAGAT G 

CAAGA7TTAG GTCAAGAATA CTTAGGAATT T 

TATCGGAAAT AA 

EF096-2 {SEC ID NO: 366) 

MKKLQ SLFIGIIAII VILFFGVRQL EKASGMA 
KETGYKVNYE TFDSNEAMYT KIQQGGTAYZ IA 
NIDARFLDQS FDPKNKFSVP YFWGTLG 1 1 Y :EE 
AREVLGLSLN SLGYSLNSKN DQQLRQATDK EE 
FSGEAAEMLE NNEKLHYVIP SEGSNLWFDN" IE 
EYIGYSTPNK EAKKL.LPKEV AEDKQFYPDZ) EE 
RK 

EF096-3 ( SEQ ID HO:367) 
AAGTGGCAT GGCAGGAGCA 

GATACCTTGA CCATTTACAA TTGGGGGGA C 7 
GAAAAAGAAA CAGGCTATAA AGTCAATTAC G 
ACAAAAATTC AGCAAGGTGG CACAGCCTAT G 
CAAAAAATGA TGAAAGCGAA GATG7TT77A C 
GAAAACATTG ATGCACGCTT TTTAGATCAA 7 
CCGTACTTCT GGGGCACGTT GGG3ATTA7T 7 



: 7 A. E 7 TA7GGCTATT 

; : EE7 GGCAGGAGCA 

"~7GA7 TAAGAAATTT 

-.AEG A AGG7ATGTAT 

— 7 E-A ATA7ATGATT 

,AA7T AAAAGGCTTA 

GEE '-A GTTT7CCGTT 

EG 7' EA EGG G GGTCAG 

e gaegattgat 

EE E7 AAA GAG T AAA 
■AGEAA ZAATGTCAAA 
vGTGC A ETTGCTGTA 
"TA EA TTATGTGATT 
AGGGAAAAAT 
,A7 GG GGCACAAAAT 
AEEEE7 A CAAAAGAA 
A77 77 A 1 E AA G TTT A C 
GEG EA GTTTAAGATG 



G EE. . EEY:E. 'EG ' 1 E ?AL :?'.KFE 
• "A ■" G ; E rE EGG' .E1EE? GGHSKLKGLE 

ag . " • ; : C:Eegege;g?e z?7j:cvmlidg 

7E GEE E E EEA rEEEEEGKEEEX ATCE ES AVAVT 
EE-A7EGGEX ECAYAFI !!CFM I APENAAQNA 
A GG. AA EI 'AA EGG I Y XZGFEEFKMY 



GG CEAE TEG AT TAAGAAATTT 

GG A777EGGEG , GA AG ETATGTAT 

EA EG E A EGG GA ATATATGATT 

7 G A G A. G E EG-, ATT AAAAGGC TTA 

77 CCEAAAGE7AA GT7TTCCGTT 

EA AA77EA7~GA CGG G GGTCAG 



WO 98/50554 
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TABLE 1. Nucleotide and Amino Acid Seqcv.c:uv> L. UjccjI:^ ' jcucs. 



ATCCAACATT GGGATGATTT 
GGCGCTCGCG AAGTGTTAGG 
AACGACCAAC AATTACGTCA 
GCAATTGTTG CCGATGAAAT 
ACTTTCTCTG GTGAAGCTGC 
CCCAGTGAAG GCTCTAATCT 
AAAGAGGGTG CCTATGCATT 
GCAGAATATA TTGGTTATTC 
GTTGCCGAAG ATAAACAATT 
CAAGACTTAG GTC AAGAATA 
TATCGGAAA 



ATGGCGCCCG GAA" ./.-.- "-.A 
ATTATCTTTG AA /. AT A A 
GGCTACCGAT AAC-T TA A ' A 
CAAAATGTAC AT . AA ". ,AA' - 
TGAAATGCTA GAAAAAA.A A- 
CTGGTTTGAT AAAAA'T' : A 
TATGAACTTT ATG AT ACTA .' 
CACACCAAAT AAA A. A AAA'TA. 
TTATCCAGAT GATG.TTA ATA 
CTTAGGAATT TATA- A AG ' 



ATAATGTCAT GCTGATTGAT 

gatatacatt aaacagtaaa 
g att a.ac gaa caatgtcaaa 
aagaaagtgc agttgctgta 
aacatata aa ttatgtgatt 
tgcctaagac agccaaaaat 
cagaaaatgc ggcacaaaat 
aaarj 1 . c tatt accaaaagaa 
taaataaattt agaagtttac 
tattattt ;a gtttaagatg 



EF096-4 (SEQ ID NO:368) 



SGMAGAD TLTIYNWGDY IDPALIKKFE 

KETGYKVNYE TFDSNEAMYT KIQQGGTAYD 

HIDARFLDQS FDPKNKFSVP YFWGTLGIIY 

AREVLGLSLN SLGYSLNSKN DQQLRQATDF 

FSGEAAEMLE MNEHLHYVIP SEGSNLWFDN 

EYIGYSTPNK EAKKLLPKEV AEDKQFYPDD 
RK 



ia:aa-a::/ kta:kaai:aap ldhsklkgle 

nt .a . _ c aaaa_atae lknnvmlidg 

LKTJTTT—AA T 'ATEAKXYM AKEESAVAVT 

I AAA A AT- AAA fgataaiiafm LAPENAAQNA 

ettaa.t a- :aaaa-:; iy nalflefkmy 



EF097-1 (SEQ ID NO:369) 



tagaagtatt ctaattatct 
atgcattcgc tcttttttaa 
ggtcatcgtt tgagtgggat 
ttgtctttgg tggctggcta 
acgataatga ttcgagttgt 
gaggaacaac gtggcggcgt 
gatgttccac agttgtttgg 
aaaattgaac aaattctctt 
tttttagcag gaattgtggg 
gctgttgaaa gcgctagttt 
cttttaccat tggttcacgt 
attaaccatg gcttattaac 
attttatttc tattggaaac 
ctgtttgggc ctgtaggaca 
gggggcattc atgaaattta 
gtaattgctg gaggaatgag 
gctccagctt cgccaggttc 
ctggcggttt ttagcggaat 
ttattaaaac gtcaacgagg 
caagtggaaa cagtcacacc 
ggctcaagtg ccatgggggc 
atgcctgtga cttaccagtc 
attcaagcag aattgaaaca 
gttcaaaatt ttttagaaat 
tcttctcaag agcaatcttc 
atacagaagc ttgttttttt 
gaattattgc ggcaacaagc 
ctggaaacag tcttttttac 
gcctatcatt tagatctaac 
aaagagtatc aagaatggct 



ACATAGAGAG CCA AAA 
ACATAAGTTT GA AAA A. . A A 
GATTATGCCA AAA/AAA, . A,, 
TACGACTGGG AAA AT. A": 
TTTAC CGATT C TA7ATTAA T T 
TGTTGCTGCT ATTAAA.AAA ; 
TGCTATGTTT ATTAGCAATT 
ACCGAAAGTT AAAAAAAAAT 
AGGACTGCTG TGC.IAT^A'T; 
TTGGCTGTAT CAA ITTTAAT 
TTTC TTAGAG C C C T'AA-' , A. A A 
GCCTCTAGGT TTA.AAA AAA A 
AAACCCTGGA CCA AAA A ■ - 
ACGAAAAACA GCAGAA ATI A 
TTTTCCGTTT GTTTTAAAG A 
TGGTACGCTT GTTA'A AA AT. A 
ATTGGTTGCG ATTTTAATT A 
TTTTGTTAG C TTT C 'I G A A A A 
AATTGAACCA GA'A'C .A- : 
TAACTATCAG C AAA AT A A A AT 
TAGTTTGCTA AG AA, A AAAT 
CGTTCATCAG ATA7-TGAAC : 
GTTAGCACAA AAGTAC -A;; - 
TAAATCCTAT TA : A A A A AA A 
ACTTGGTTCA GAATAT/.A:A 
ATATGCCGAG AATATT ":A : A 
GGCGAAACAA GG A AAA A 
CAAGGAGACA AC AAA. A A' : AA 
GCAACAAAAT TTATAAAAAA 
GGAAGGAGGA GC AAAAA .AAA 1 



G '• AAA .' AA : AA G AAAAAAGAA 
ATA" A atattt AAGTCGTTTT 

ttttaatt gtggagctta 
tagatatttc t aaagtcgaa 
ttacaggagg aaaaatgttc 

TGGG AGT AAT TGTTTCCACA 
TAGCAGGATA TACTTTCGCC 
AAAAAATGCT GACTAAAAAC 
G TATT AT A 2T TGTAGCTCCG 
CTTGATTATAT T^AAGCCAAT 
A ATTATTTTT TAATAATGCG 
CTAGTCAAAC AGGTCAGTCC 
GA3TTTTGGT TGCTTTTCTG 
C AA A AAT GAT TCAACTGATT 
AAAAGAACTT ATTTTTAGCA 
TA TT T A AT G T GGG TCTAAGT 
ATG GOT AG AC TGATGCGAGG 
CTTTTA AAAT A ACAAGCTTG 
TAAAGA TAAA GGAGGAAGAC 
TTT TTT ATG A TGCAGGAATG 

taaaagatgt gauicttggag 

AGCCTAAGAC ATTAGTGGTC 
OAGAAAAGGA TATGGTGAGT 
TTTTAA A AAA ACTGACTGCT 
AAACGAAATC GACAAAACAA 
AAAT A A A A A AC AATGGGAATG 
TAAA.A AATC TAAAGAGCCA 
TAAAT TTGA A ATGGCGCAA 
TTA ATA ATTT TTTGAATAAG 



WO 98/50554 



I>CTTS98/08959 



1- - 

TABLE 1. Nucleotide and Amino Acid Seqei..- 



:ua\ 



M17F.NF LAG I7 T GGELG CFGILYYAPA 

VXSASFWLYQ F7SWL:EAAA LFLAHAFAEr 

AFAAE7NPGP G'AGVIVAFIL FG?VG:F>GA 

1AGGMSGTG7 FQ 1 FNA'GLSA FAF?G:^LVA: 

LKRv KG I E?V SMIrG^EECQ YETY7P 77:'7 Q 

PV7YQSVHQK XWQPKTLVYI QAELK^LA^K 

SQEQGSAGSE ETE7NETKQI CXL7FL .YAE!-: 

E7ATF7KE77 YA^RELAQA YHLEL7QCNL 

EF0 97 -3 (SEC 77 NO : 371 ■■ 



-.S^TGQSI 

: FAIASLL 
EAAA'LEM 
'LAXLTAS 
! E'*£XEPL 



A ' 3 G A G G AA 7TG AA C 7 A G * - 7 G AA 7 G A - AA. A G ■. . 








C j-u-i G 7 GG AAA. 










AGGAATG 


GGG7 3AAG7G 


CCATGGGGGC 


7AG77TGG7A AG . . 




" A7G , G7-w 


.G7TGGAG 


A7GG3TGTGA 


C77A 3 G AG7C 


gg77:atgag a" , a: 






7AGTGGTC 


A77CAAGCAG 


AATT7AAACA 


GTTA j JAL rtj-. j-v^-v-; - ■-■ 






7GGTGAGT 


G 77 C AAAA 77 


TTTTAGAAAT 


7 AAA 7 7 G 7"A7 






7GACTGCT 


7G77 77CAAG 


AGCAA7C77G 


ACTT AG77GA GA77A7A. 






:aaaacaa 


A7AC AGAAGC 


T7GT77TTTT 


ATATGCCGAG AG-. 7 77 :" ' 






7g< ;gaatg 


GAA77ATTGG 


G3CAACAAGC 


G G C G AAAC AA G -7 . .A 






7AGAGCCA 


C7 3-GAAACAG 


rp rp tp rTTJW-p^ Q 


:aag:;agaca A7:;a:a 






7GGCGCAA 


GGG7ATCA77 


TAGA7G7AAC 


GCAA 3AAAA7 77 A 77 A 






7GAATAAG 


AAAGAGTATC 


AAGAA7GGC7 


GGAA3GAGGA G 7 77 77.-. 









EF097-4 (SEQ ID NO:372) 

RGIEPV SMIKMKEEDQ VETVTPNYQQ : LFVCGAGIA - - 7A7FA-.. A : AQLKATuLEM 

PVTYQSVHQM KWQPKTLVYI QAELKQLAQK 77A 7 A ' 7 A 7 A A. ;'. A' i FQ7LAKLTAS 

SQEQSSLGSE STETNSTKQ I 2KLVFLYAEN '7- 7 A A! A7 7:7 . , A -77 A : VAIE7SKEPL 
ETVFFTKETT YWTRELAQA YHLDLTQQN7 Y\7 r 7. F: AG?; E7;EAA77GA DR7F 

EF098-1 (SEQ ID NC:373) 

TAAATGAAAA AGACAAAAGT AA7GACA77G A7GG7A7A A G77A~777AGG GG 7 A 7TGGCA 

CTTGTACCAA TGAGTGCATT AGCAG7CGAG GG77777AA7 AC7AAAGAAA CGGAGCGATT 

CAA7TTGCAC CAAATACGAA CCCAAGGAA7 G V-..: 'A . ' "A GGA7GAA77G AGAC7CAGAT 

AAACCAATTA CACCAGTTGA TCCAAC7GA7 G .-A A : C7AAGGGAGG GAGAGCAGGT 

CCGTTATCCA TTGAC7ATGC ATCTAGCTTA 7 7777 7 77:7 AAG777.7GA7 7A3C7CAAAA 

AATATGACCT ACTA7GCAGA AACACAAAAA 7A7AAA • A ACGC7GG7G 7 GGACGAAGAA 

GGCCCAAAC7 TTGT7CAAGT CTCAGATAA7 GG7A77A77G AGA7A7G77G GAGG7TAAAA 

G7AAAACAAA ATGG7CAAT7 CAAAAGTGAA ^ 7G\; A A. • AAG777A7AG: GGGCAAAGTA 

ACT7TAAGCA ACGGACGCGT GGT77CAGC7 77AGAAG;:7G GA.AAGGG.AAG GAGAGCGCCA 

GCTACGATTG AATTAAACCC AACTGGGGC7 G7777 7 7 7 7 7G 77A7G777G7 7G7CGATAAA 

GAAGGTGCGG GTACGTACTT AATGAGCTGG GG7 77 7 77 A'G 7AGA7A7 7 77 7AAAACAAGT 

ATTTCATTAG AAGTACCTGG 77CAACGACA AAA 7"- 7. AAAA77A7AG GA7AACTT7T 

ACT7GGACT7 TGACAGATAC ACC7GGTAA7 A7A AAA.- 7 AA 

EF0?£-2 ( SEQ ID NO:374) 

MKKTKVKTLM ATTTLGALAL VPMEALAVCG G77A " '■ - . FAA77777:;? "7P7NPDPDK 
PITPVDPTDP TGPKPGTAGP LSILYAGSLG F. 7 . ' A' A 177777. 77' , KV KENAGADQEG 



WO 98/50554 



PCT/US98/08959 



TABLE 1. Nucleotide and Ammo Acid Seqa;c:!,.> : z.hica:fiy (iencs. 

PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NTT'.; "' A ; A" 'T G'GGG ' " '."'AS QSAKPTTAPA 
TIELNPTGAE SWMAAGDKE GAGTYLMSW3 1.C7E G7 7 7 77: '.7 GACTF YAKKYTTTFT 
WTLTDTPANT GN 



EF098-3 (SEQ ID NO:375) 



AGTCGAC GGTGGTGAAT ACCAAACAAA CGGAGCGA77 








CAATTTGCAC 


CAAATACGAA 


CCCAACGAAT 




TG CGACG 


\ATCC 


AGACCCAGAT 


AAACCAATTA 


CACCAGTTGA 


TCCAACTGAT 




GC L. i /-.A 1 ,-- 


::::agg 


GACAGCAGGT 


CCGTTATCCA 


TTGACTATGC 


ATCTAGCTT A 






v:gat 


TACCTCAAAA 


AATATGACCT 


ACTATGCAGA 


AACACAAAAA 




ta acgct 


-:,ZTGC 


CGACCAAGAA 


GGCCCAAACT 


TTGTTCAAGT 


GTCAGATAAT 






v*;TTG 


GACGCTAAAA 


GTAAAACAAA 


ATGGTCAATT 


CAAAACTGAA 






\cagc: 


GGC CAAAGTA 


ACTTTAAGCA 


ACGGACGCGT 


GGTTTCAGCT 






• :;,ac 


GACAGCGCCA 


GCTACGATTG 


AATTAAACCC 


AACTGGGGCT 






:;:tgg 


TGGCGATAAA 


GAAGGTGCGG 


GTACGTACTT 


AATGAGCTGG 




7 .4 7 A Cm 


\ : ccc 


TAAAACAAGT 


ATTTCATTAG 


AAGTACCTGG 


TTCAACCACA 


A-----L a --GC-- ^ 


-sLt-'. 


- •'• •AC 


GACAACTTTT 


ACTTGGACTT 


TGACAGATAC 


ACCTGCTAAC 


a- -a --aaa 


CT 







EF098-4 (SEQ ID NO:376) 

VDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK 



PITPVDPTDP 


TGPKPGTAGP 


LSIDYASSLS 


:-••!•;.. 7 


ICG7N 


MTY 


7 A 


:~oky 


KDNAGADQEG 


PNFVQVSDNR 


GTETGWTLKV 


KQNGQFKTEA 


IK e: 


AAK7T 


L£N 




'/GAS 


QGAKPTTAPA 


TIELNPTGAE 


SWMAAGDKE 


GAGTYLMSWG 










;gttk 


YAKKYTTTFT 


WTLTDTPANT 


GN 
















EF099-1 (SEQ ID NO:377) 














TGATGTTGTA 


GAGGGCTGAT 


GAAATGTTTA 




■ — rp 

- X 






7TGAA 


AGGAGAGATC 


ATGAAGAAAT 


TAGGCAAGGT 


TTTAATTGTT 


A tTYG 


TCTTA 




TT 




TG CTTTTTTA 


TTATTTTTAG 


GTGTATTTTC 


TTCTAGTGAA 






CTT 




cagtt 


TCAGCCCGCT 


ACACCACAGG 


AAAAAGTAGC 


ATTAG AAG T T 


t rTAA 




TGA 




1CACA 


T3GCGGAACG 


TTGCAGTTTG 


CTTCCGCTTG 


GATTGGCAAT 






AAA 


GT 


3GATT 


AAATCCTGCT 


AGAATTCAAA 


GTGATTTATC 


GTTTAATTCA 


g : i;.'.: 




TTAAT 


gcttc 


GTTAGGCGGT 


TATGGAATTG 


GGTTAGGACA 


ATGGGATTCA 






TTA 




IT ATT 


AAATTTTGCA 


AAAAGTCAAA 


AAAAGGAATG 


GAAATCAGTA 


gcttc 


AC AAA 


TGG 


A i 


ITTGC 


GTGGAATAAG 


GATGGTTCTG 


ATAGTGACTT 


ACTTAAAAGA 


ATGTC 


77.. A AT 


CAA 


AAA 


7ATGT 


GAATACACTT 


GCGGTAGATA 


TTTTGAAGCT 


GTGGGAACGA 






A-AG 


A'T 


:7ATc: 


C C-C AG AACAA 


GTAAAAAGAA 


AGGCTAGTGC 


TAATAATTGG 


tac.ag 




TTT 


C i 


A CAGG 


TTCCATGGGC 


GGAGGTTCAG 


CCAATGTTGG 


TG G AG G AAA A 


attg; 


TGCCT 


TGA 




A AAGT 


GATGGGGCAA 


ACTATTAATG 


GTGGTCAATG 


TTATGGCTTA 


tctg: 




TTG 


T7 1 


GAAAA 


ACAAGGAGGT 


CTACAAATGA 


TGGGTACGGG 


GCATATGTTT 






TT7 


GI 


A77TGA 


TTATCCTTGG 


AGTTCAATTG 


GTTGGACAGT 


CATAAAGAAT 










-■~^TAA 


AGCAGGAGAT 


GTCATTAATT 


TTGGTCAAGG 


TGGTGTGGCT 


acta-: 




at: 




CAT AC 


TGGTGTAGTG 


GCAAGTGTTG 


AAGGTAAAAA 


CAAGTTTACT 


acta; 










ACAAGGTCAA 


ATTGTTGCTA 


AGTATTTTCG 


GACTTGGG3A 


ttac; 




ca: 




GTGAC 


CAGCATAGTA 


AGGAAATAG 



















EF099-2 (SEQ ID NO:378) 

MKCLS VFFLLKGEIM KKLGKVLIVS CFIFILP777 ]■ G7VFS CG EG GOSSQFQPAT 
PQEKVALEVS NYVTSHGGTL QFASAWIGNM EHEGGLGTAR 7;GG1 77SA IAFNPSLGGY 
GIGLGQWDSG RRVNLLNFAK SQKKEWKSVA L7M77AYG7-;D GGAG: 777RK SKSKDVNTLA 



WO 98 50554 

TABLE ]. Nucleotide and An:mo Ac: J Sjci 



PCT I S96/08959 



azitTAgdv 



ttg7agtttc 7TT7cgct7g gattggcaat a: aag a;.;. a -77777 aaat7ctgct 
agaattcaaa ctgatttaac gtttaattca ■ 7.7 t: a;a "aatc ^ttasgcggt 

ta7g7-aatt7- 3gttaggaca atgggattca 7 7 a : 7a7 77aa77 7a77 aaattttgca 

AAAAG TC AAA AAAAGGAA TG G AAAT7AAC 7 A 777 " A"-. A.-7. 7 .-GA7 7 77 GC 3 TG G AAT AAG 

GATG3TTCT3 ATA 77GACTT A7TTAAAA7A A" 7. " A .7 AAAAA A,73T AAATACACTT 

GAG 3 TAG ATA TTT 7GAAG 7T GTGGGAA7 GA 77A.7 ' A- - A AA7A7 AA ZZ 7GCAGAACAA 

GTAAAAAGAA A73:TAGTG7 TAATAATTGG " ' 7 .: 77 77AA AGG 7T77 ATGGGC 

GGAG 3TTCAG :CAA77TT7G TG 7AGGAAAA 7.77../ . 7 7 .-.ZZ Z'l 7ATGGGGCAA 

A CTATTAATG GT3 3T7AATG T7ATGGGT7A : " A - . 777 77 7 77 ..- A .-AAA ACAAGGAGGT 

7TACAAATGA TGG3TACG jG G7ATATG7A7 G7.AA '• : AAA TT3G7AA7GA TTAT 7CTTGG 

AGTT7AATTG GTTGGACA7T CATAAAG.-A7 7 7 AAA !' . A "77 7A7A7A777AA AG 7AGGAGAT 

GTCATTAATT TTG.77CA.sGG T33TGTGGCT A AAA . 7'" 7 ATGGG7A7AC TGGTGTAGTG 

GGAAGTGTTG AAG3TAAAAA 7AAGT7TA 77 A777/A A' A AAGG,C7 77GA ACAAGGTCAA 

ATT3TTGCTA AGTATTTT7G GAGTTGGGGA 77.-7.-- 777 7A7A77A77AC 7AGCATAGTA 
AGGAAAT 

EF099-4 (SEC ID N7:380} 
SES GDSSQFQPAT 

pqekvalevs NYVTSHGGTA ^fasaat 777-; zzz: ' : : : . : . • : 7:a aafxpsaggy 

GIGLGQWDSG RRVNLLNFAK SQKKEVTKGVA L.XA" AAA": A. A ; 77777 7KSXGVNTLA 

VDILKLVJFRA GTKDDPAEQV KR?CASA777'7 K77 . A A, ZZ ZZZ 7AA-K 7. GAAEKVMGQT 

INGGQCYGLS AFFVEP'QGGL QKM3TGHXFA 77: A: 7" '7 Z I A7X.A 717? 777SGIKAGDV 

INFG7XSGVAT SIYGHTGWA SVE3KNK777 77 v 77" AA : VA77777AGL 7FPHVTSIVR 
K 

EF100-1 (SEC ID 110:381) 

TANTTATGGC AATATGGAAG GAGTTTTATA A77,,7. •, ■ AAA A A : 2 A A A A ATA 7GCAGGGTTT 

ACATTATTAG AAA TG TTG A T TGTCTTATTG AT777777C7G TATTGATTTT ACTTTTTGTC 

CCTAACTTAG CGAAACATAA AGAAACAGTT GA7 .77.7 AAA- G77AA77A7GC AATCGTAAAA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAA.AAAA AT AAG A 77 CC 777CTTAAAT 

GAATTAGTCA ACGAAGGCTA C ATTA7 T AAA GA7 AA ". ". 7.7 7 A7.AA/A7A7A7 A G 7 AG AAAAG 
CAATGA 

EF1C0-2 (SEC ID 130:382) 

MKKKQKYAGF TLLEMLIVLL I ISVLILLFV PAA/7 7A.E77' TGAGAGAITAK 

IVESCIELYT LEKNFTPSAN ELVNEGYI7X E; 7:7 7"ArK ; 

EF10Q-3 (SEC ID NO:383) 

TAA AGAAACAGTT GATAAAAAAG GCAATGAA7 7 /AA 77A7AA-AA 

ATTGTAGAAT CACAAATCGA GCTCTACACA CTA.A--A.AA A 1 "" A .A A A 7 77C TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA 7AA 17.7" 7.77 A7A7A77/AAC AG C AG AAAAG 



WO 98/50554 PCT/US98/08959 
TABLE 1. Nucleotide and Amino Acid Scqcr..-:. f . . Ua ; > Genes. 



CAAT 



EF100-4 ( SEQ ID NO:384) 



KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EvLIA 



EF100-1 (SEQ ID NO:385) 



TANTTATGGC AATATGGAAG GAGTTTTATA. 
ACATTATTAG AAATGTTGAT TGTCTTATTC 
CCTAACTTAG CGAAACATAA AGAAACAGT7 
ATTGTAGAAT CACAAATCGA GCTCTACACA 
GAATTAGTCA ACGAAGGCTA CATTACTAAA 
CAATGA 



at — .c-a aa^ aaa a a ta cgcagggttt 
;. : :-:v. a tattgatttt actttttgtc 
gataa,..,, j.a gcaatgaagc aa tc g t aaaa 
cta ; - a; a ataagacgcc ttccttaaat 
gag. a a: r;.G ATAAATATAC agcagaaaag 



EF100-2 (SEQ ID NO:386) 



MKKKQKYAGF TLLEMLIVLL IISVLILLF\ 7 A: ::r V DKKGKEA 1 AAK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDK V~ AAA Q 



EF100-3 (SEQ ID NO:387) 



TAA AGAAACAGTT GATAAAAAAG GCAATGAAG A A_A : CGT AAA A 

ATTGTAGAAT CACAAATCGA GCTCTACACA C TAG A A/, AAA ATAAGACGCC TTCCTTAAAT 
GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGC.AA': TAG AT/^T^.TAC AGCAGAAAAG 
CAAT 



EF100-4 (SEQ ID NO:388) 



KETV DKKGNEAIVK 

IVESQIELYT LEKNKTPSLN ELVNEGYITK EA AAAVAAKK Q 



EF101-1 (SEQ ID NO:389) 

TGAGGAGATG AAACGAAGAA AATGAAGAAG 
GTAATTGCGG TTGGGGGCAT CGTAACTGTG 
GCTGTCAAGC AAGCGCCTAA AGATGACTGG 
CAACAAATTT ATATTAACGG TGTCATCCAA 
CAAAAAATAA CAAAGGATCC AGAGATTAAG 
ACAGAATTAT TTACTTATGA AGATGAGGCG 
AGCTTAGCCA AATTAGAAAC GAAGCGGGCG 
GATAAATTTA ATAAAACTAA AGAAGAAGA : 
CAATATCAAA CAGAAGTCGA TGCAGTAGAT 
GCGGATTTAG GAGCGAAGCA ATATATTTC : 
ATTCCAGAAG TAAAAGATGC C AATTC AC CG 
TTAGCTGGAA AAGTGAATGA AAAGGACTTG 
CTAACTTCTG TTTCCAACAA TGTGGTTGT 3 
CCTCCTGAAG GCAACAGCGA TGCCGCGAGT 
AGTTATAGCG TCAAAATTGC GTTGGCCAAT 
CAAGCAACCA TTGATTTAGG CGATTTAGG 3 
AAAGAGGGTG AACAGGCCTA CGTTTTAGI G 
GTCCAAGTCG GGCAAGAAAA TGGCGACAAA 
GACCGAGTGG TTATTTCTTC AAAAAAACCA 



AAAA • :aa TTATATTGGG GGCAGTTGCG 

aatgaattaa ATAAAAATGC ACAACAAGTA 

GGAAAAAACT ATTTTGAC3T TCCCGACTTG 

CCGGAA. AAA TGGAAGCCTT TGCGCGTGAT 

GTGAAAAACG GCGATGTC 3T AGATGCAGGC 

GTAA A AAA AAATTGAGGC ACAACAAAAT 

AATAA " A r AA ATAAGTGGAA TCGGGCCATT 

c-:v ^ gat ctggtgatga tttaaatgaa 
gaaaaa-- ha ccttcaccaa tgaaacctta 
aiaaa. ;. cia atttaaaagg tcgtgtatca 
atttta: ggt taacttcaga agatctttat 
a::':,.;.- aa-, gtgttgggca aaaagctaaa 
gataa ••; :;a.a tttgttacat cgatgataat 

GGCAAA CAG AGG j CGGC AC AACGATGTCT 

ttagaaaaag TCAAAAATGG ctaccatatg 

g:ca7taagt tac:gaaaaa agcgattcaa 

aatgatitag gaa:gatcat tcgtcgtgat 

ATG~AAAT7A aatctggctt agaatcagcc 

GTA.-.A-. A ' GTGA7AT7GT TGAATCAGAT 



WO 98 50554 



PC T/VS98 U8959 



1- 

TABLE 1. Nucleotide and Amino Aud Seqe \.\ . 0.-ne> 



ef: gi- 



ii: : 3 9 C " 



mkkk 
qiyiw 

lakletxrah 
glgakqyi st 



iq? egkeafafg: 
iykkwnraii: 
kakfkgrvsi 



FEYKEANS?: 



7SVSNWVVD GSISYIDENP PEGNSEAASG V.l 
ATIDLGDLGA I ELPKKAIQK EG EQAYYLYN IY 
FWISSKKPY KVGEIVESEA AIA3DESA7:: F = 



:e:eaqqn£ 

G7/GGXAKL 

:gykmq 

:esglesad 



EF1C1-3 ( SEQ' NO: 



taaaaatgc a 

GCTGTCAAGG 
C AA C AAATTT 

caaaaaataa 

ACAGAATTAT 
AGCTTAGCCA 
GATAAATTTA 
CAATATGAAA 
GCGGATTTAG 
ATTCCAGAAG 
TTAGGTGGAA 
CTAACTTCTG 
CCTCCTGAAG 
AGTTATAGCG 
CAAGCAACCA 
AAAGAGGGTG 
GTCCAAGTCG 
GACCGAGTGG 
GCAGCGATTG 



.C AACAAGTA 
AA. G C G 3 C T AA 

atattaac 3g 
caaaggat:: 
ttacttat3a 
aattagaaa: 

AT AAAA C T AA 

ca gaagtc ga 
gagcgaagga 
taaaagatg : 

AAGTGAAT3A 
TTTCCAACAA 
GCAACAGCGA 
TCAAAATT3G 
TTGATTTA 3 G 
AACAGGCCTA 
G G C AA G AAA". 

ttatttctt: 

CTTCTGATGA 



T5TCATCCAA 
A j AGATTAAG 
AGATGAGGCG 

T3GAGTA GAT 

atatattt: : 

C.AATTCAG gg 
AAA3GACT7G 
TGTGGTTG7G 

tggggcgagt 

GTTGGCCAAT 
CGATTTAGGG 
CGTTTTAG7G 
T ^'^i C ^j^a — Aa.-i 
AAAAAAAC ^A, 

atcagcaa:: 



gg:. 



a: . : " -A " ":r tgecgacttg 
-a-- tgggggtgat 
ggga7gtggt agatgcaggc 
/g.g^ygaggc a:aagaaaat 
a7aag7ggaa tggggccatt 
ggggggagga tgtaaatgaa 
:;:tt:ac;:aa tgaaacctta 
AT7T3AAAGG tggtgtatca 
TAAGTTCAGA A3ATGTTTAT 
GG ' G 7 TGGGEA AAAAG C TAAA 

tt7c77agat c ga7gataat 
ag:-g:ggva: aj\ggatgtct 
tc a aaaa g 3 g c 7 a 3 3 atatg 

TAGG3AAAAA A G 3GATTGAA 

gaaggatcat tgggggtgat 
^aagtggett agaatcagcc 
G r :*.-ATAT r ;:;T tgaatcagat 
tgagagag g: gtcgaaat 



EF101-4 (SEQ ID NO:392) 



KNAQQVA VKQAPKDDWG IDYFDVPDLQ 

QIYINGVIQP EQMEAFARDQ KITKDPEIKV K:Y 

LAKLETKRAN IYNKWNRAID KFNKTKEEDR TIY 

DLGAKQYIST KAKFKGRVSI PEVKDANSPI LEI 

TSVSNNVWD GSISYIDDNP PEGNSDAASG IGP; 

ATIDLGDLGA IELPKKAIQK EGEQAYVLYN DE 

RWISSKKPV KVGDIVESDA AIASDESATN E£: 



ELFTYEDEAV TKEIEAQQNS 
YQ^EYDAVDE EITETNETLA 

ag?:v:;ekelt kisvgqkakl 
ysv?::ala::l dxvxngyhmq 
C'gg :;-e : :g : : ;gm a : e s gles ad 



EF102-1 (SEQ ID NO:393 
TAAACATTTG AGACATTCAG 
ATTAATTATA TGAAAAAAAC 
TTAAGTCAAA CAATTGGCGG 
CAGCCTCAAG AGGTAACAAT 
TTTAGCGATG GCAGCACGCT 
GCAAAACAAA CAGTATTTTG 
GGTTATCAGA AAAACCCTTT 
TGGGAAAAGG CTGGAACAGA 
GAAGTGAACG GTTATAAACT 



) 

AGGTGAATGT CTGG 
AACATTTAAA AA77 
AACGATTG3T CCTG 
TCATTATGAG GT/J; 
CTCAGAACGT AGTG 
TATTGAACGA GGT3 
GCCATCAATG TCTG 
TATTGATA3A AATG 
CCATTGCATA A.-,,. 



ttaetcaaaa acgaaagggg 
tatttgcgac tttg3ctcta 
c7tttg0g3a tgaaattact 
tgtatgaagt tga gggaagt 
ta7a7gcaga atacaatggt 
ttccaacaga agtgacgcac 
cgaaactagt atcg3ttctt 
3 a 3 aaaja gat gatttgggaa 
gggg7gg77c ag77gatata 



WO 98/50554 



PCT7US98/08959 



l'v'- 

TABLE 1. Nucleotide and Amino Acid Seqeuenc : jua nils denes. 



AAATCTATTG 


AAGGAAAAAT 


TAATAAGGCA 


attga - 




/ \ -L < 




-.A AAA 


AGCAAGTTTT 


CATAATACCA 


CTGTAAAAAC 


AATTTTAGGT 


ca agg - 


g::aa 




GTAGA 


TAAAAATGAA 


TTAAATTTAT 


CTGAGTTTGA 


TAAAGTCGTC 


CAAAA. - 




GG.- 


lA - r.T.'-'JjA 


TTACCGTGTA 


ATTGGGAATC 


AATTAGTGCT 


TACTCCAAAC 


TGTAGG 


g: ga 


aatgaggaac 


ATTAACATTG 


AAAAAATCAG 


CTGGTACTGG 


AACTCCAGTC 


ggtta; 


■' . -TAA 


AAC 


j; ./I'jb.'i'G 


t:aaactgtg 


ATGGCTGGTG 


CGCTTGATAA 


GCCCAATACC 


TAGGG" 




AJG 


ATT, 


VATGT 


GGAAACTAAG 


GGTTCTTTAA 


AGATCAAAAA 


AATCGATAAA 


GAATCA 


;. "^7^ 




~;ta:c 


AGAAACGGTT 


TTCCATTTAG 


ATTTTGGGAA 


AGCTTTACCT 


TGA.AAG- 


: , r ; . : 


TAT 


V AAA C AG A 


TAAAGATGGG 


ATTTCTATTT 


TGGATGGAAT 


TCCCCATGGT 


a :aaa ■ 


ga;, 


:t. 


ATT. 


*CT 2A 


AAAATCGGTG 


CCAGATCCTT 


ATATGATTGA 


TACCACACCC 


ATGGG' 






■-T 


i..A„AGC 


GGGCGAGACC 


ATTTCCATGA 


CTTCGAAAAA 


TATGCGACAA 


AAA A 




TT 


":TT 


ggaga 


GAAGACTGGG 


GTAGAAACAG 


GTACTGATCT 


TTGGAATGAC 


AATTA" 


. . -. , r 


I 1 A' 




ttaaa 


TACATTTGCC 


ATTCGTAAAG 


ACAGCCCAGC 


TGGTGAAATT 


c; ;• •;, 




TA. 


A 2 A 


aggja 


T 3AAAAAGGT 


CGTGCGGAAA 


CACCAAAAGA 


GCTTGCTAAT 


g:tti 




TG: 




v:gta 


TTACGTGACA 


GAAACTAAAT 


C TAG T AATG G 


TTTCGTGAAT 


a:gt: 








^■GT 


CGAGTTAAAA 


TATGCCAATC 


AAACCGTGGC 


TCTTGTTACC 


AGTAA . 


■■ -n - 


AA< 




:aa,aa 


C GAAGAAATT 


ACTGGGGAAA 


CCACTTTGAC 


AAAAGAAGAC 


A_rwv-.~ - 


; ■ r: 


3T. 




JAG AG 


T : AAGGGAAA 


GCTGAGTTTA 


AAGGAGCTGA 


ATATACTCTC 


ttta • 


■:;AA 


AA< 


aaaa 


tgt:a 


AGCTGTTAAA 


TGGAGTGAAG 


CTTTTAAAAC 


AGAATTAGTG 


AAGG-- 


;-. gga 






^TGA 


TGAAACAGTG 


ACTTTGGCTT 


TAGATGAAAA 


GAACCAAGTT 




■GAGG" 




GTA 


J TAAT 


TAACGAGTAT 


TTCTGGCAAG 


AAACCAAAGC 


ACCTGAAGGA 


TATA G' 


' GTG 


AT- 


~; A A 


a.::aaa 


GTATCCTGTA 


TCCATCAAAA 


AAGTTGATAA 


TAACGAAAAA 


aatj: 


" GA A 






GGA 2A 


T3TTACGGCA 


AAAGAACAAG 


TTATTCGCTT 


TGGCTTTGAT 


ttg a: 








:.-:ATC 


GGCTGATGGC 


ACTGCCGAAA 


CTGGATTTAA 


CGACTTATCT 


T7TAA. 


7G .I 1 






at: J A 


AGGGACCAAN 


GAAATCACAG 


GTGCTGAAGA 


TAAAGCGACC 


agajg 


G . . 




jag; 


:aatt 


AGGTTTTGAT 


GGCTATGGTA 


AGTTTGAAAA 


TCTTCCTTAT 


G AG TA~ 


-. : ... .. 






AAA. 3 A 


AATAGAGGCT 


CCAGAAGGAT 


TTCAAAAGAT 


TACACCACTA 


GAAA.T 


JTT 


i"'~p 




GGLTAA 


G3AAAACAAA 


GACGACTATG 


CGAAGAGTGA 


GTATGTCTTT 


a g :a: .. 


agg 




AAA 


; aa :a 


AAAACAACCA 


ATTAAGATGG 


TGACCGTTCC 


TTACGAGAAA 


CT/Vi. J 




*\ - 1 


tag 


TTTTC 


TGTTAGTCTG 


AACCGTTTGA 


TGCTTTATGA 


TTTGCCCGAG 


A AA GAG 


■'■ATA. 






A GTTC 


TCTTGCGACT 


TGGAAAGACG 


GAAATAAAAA 


ATTGAATACC 


C TTG AT 






jA J 


JTA3T 


TGATAAATTG 


AGATATAACT 


TGCATGAAAT 


CAAAGAAGAC 


T jG TAG 


GTGG 




2-2'V 


7AA3C 


CATTGATGTG 


GAAGCCACAA 


AAGCTGCCCA 


AGAAAAAGAC 


gaaaa; 


■"TGA 






!3TG 3T 


GATTGCCGAA 


ACAACCGCAA 


CGTTGGCGAA 


CAAAGAGAAA 








-iAAATTTT 


GGATAAATTA 


ACCGCTGAAC 


AAGTTTTGGA 


TAAAAGCATC 


GTGTT 




ATTAT 


3 TATA 


TGAAAACAAG 


GTAGCCTTTG 


AAGCAGGCAA 


TGAGCCAGTA 


gg j;g 








TT3AA 


CAATCAAGCA 


CAAACCGTCA 


ATTGTACGAT 


TGAACGCCAT 


gtgtg 










CGACCTAGAA 


GATGGTTCGC 


AAACTTTTAC 


TCATGGTGAC 


GTGAT : 






TTT 


GA A GA 


TGTGTCGGTT 


ACCCATGATG 


TACTGGATGG 


CTCAAAAGAA 


GCTTT' 






ATT 


gtgta 


T3CTTTACTA 


CCAGATGGTA 


CGAACAAAGA 


AATTTGGAAA 






'T'V 


.TAG 


;:atga 


AGTGAATGAT 


AAAGAATTTA 


CCAAAACCGT 


ACTTGCGGAA 


aaagt; 








AAGTA 


TCCAGAAGGA 


ACTAAGTTTA 


CTTTTACGGA 


AATCAATTAC 


G A/ \„' i 






A AG 


31 AAA 


TGGAAAACAC 


AATGAAGATT 


TGAAAGAAAA 


ATCTCAAACC 


TTmmL, 






GAA 


3T3GC 


AACCATACCG 


AGTACGCCAA 


AACAACCGGA 


AACACCAGCT 


gttgc; 




AT 


TGTGAAGA 


ATCTAGTCCC 


ACAGTGAAGA 


CATTCCCGCA 


AACTGGGGAG 


aaaaag 






GTT 


GTAGT 


GTTAGTTGGC 


TTTATCTTGA 


TTTTTTCGAC 


TGCTGGGTAT 


tattt: 




AT 


CG" 


gggaa 


TTAA 



EF102-2 <SEQ ID NO-.3 94) 

MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFAEEGG-! 

PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TST GAL ."GGA KQTVFCIEPG VSIPTEVTHG 
YQKNPLPSMS DKAKLVSVLW EKAGTDIDTN MVA GGGG '.. GGG VWGYKLHGTK RLGGASVDIK 
SIEGKINKAI EEYQKKPSFH NTTVKTILGQ GTTG: GGG NLSEFDKWQ NTANIDYRVI 
GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKGAG. , '." AGALGHPIGGY A I K I TTVETKG 



WO 98 50554 



>CT'l 1 S98/08959 



TABLE 1. Nucleotide and Amino Aci^ 



gpyp::::tt?m aa/::k 
Avr:^;A:.v~s ntvkgc 



-.e:=xa7t 

(SEYVFT 



DVAK 

KDGNKF.LNTL DFTELYDKLR 
TAT 

tvk 
dgt:; 

E 



:gk: 



PKELAI 



gtfasletvt 
av:tf.d' t tak 
acneqlgfeg 
iteeg^kqpi 



I ERHV 



GTWKILHKLT AE C V I , DF £ I \ 

sic; tka hIjED g s c t ftp: g g*. 
feiwks g>;:ekeyi:dk eftktvlaf? 
eksqtl tpkevptifs tfkcpetpa' 
stagyy fwnrrn 



:■ FPTKVELKY 

a_kdgqavkw 

l.DETKYPVS 
>' VGPGEGTXE 
I RETFKENKD 
FGF1TSLATW 
FAivP'rv'IAET 
FDAS1NNQAQ 
PD77 GYALLP 
FDGir/NGKHN 

::£:Tv t lllvgf 



EF:G;. -3 (SEQ ID NO .395) 



TT TAGATGAAAA GAACCAAC 
TTCTGGGAAG AAACCAAAGC 
TGCATGAAAA AAGT7GATAA 
AAAGAACAAG TTATTCGCTT 
ACTGCGGAAA CTGGATTTAA 
GAAATCACAG GTGCTGAAGA 
GGCTATGGTA AGTTTGAAAA 
CCAGAAGGAT TTCAAAAGAT 



GACGAGTATG 
ATTAAGATGG 
AAGCGTTTGA 



CGAAGAGTGA 
TGACCGTTCC 
TGCTTTATGA 



tggaaagacg gaaataaaaa 
agatataact tgcatgaaat 
gaa™:acaa aagctgcgga 
acaagcgcaa cgttggcgaa 
accgctgaac aagttttgga 
gtaggctttg aagcaggcaa 
caaa^cgtca attgtacgat 
gatggttcgc aaacttttac 
acgcatgatg tactggatgg 
ccagatggta cgaacaaaga 
aaagaattta ccaaaaccgt 
actaagttta cttttacgga 
aatgaagatt tgaaagaaaa 
agtacgccaa aacaaccgga 
acagtgaaga 



TT 



TT GGCGTTAAA 
ACCTG/^j-iGGrt T 
TAA C G rij-i^rtAj-v ^ 
TGGCTTTGAT 
CGA TTTATGT 
TAAAGCGACG 
TCTTGCTTAT 
TACAGCACTA 
GTATGTCTTT 
TTAC 5 A. G AAA 
TTTGGGGGAG 
ATTG AAT AG ; _ 
CAAAGAAGAC 
AGAAAAAGAG 
£ AAA 3 AG AAA 
TAAAAGGATG 
7GAGCCAG7A 
TGAACGCDAT 
TCATGGTGAG 

ctcaaaagaa 
aatttggaaa 
acttggggaa 
aatcaatta: 
atctcaaacc 
aacacgaggt 



TT 



AAA. 



jTtt : 
re AG : 



jTG £ 



gtatgctgta 
tgttacggca 
gggtgatggc 
agggaccaan 

AGGT77TGAT 

ajvpagaggct 
ggaaaacaaa 

;,A.*vAGAACCA 
TGPTAGTCTG 
TGTTGCGACT 
TGATAAATTG 
TATTGATGTG 
GATTGCCGAA 
j CA.TAAATTA 
TG AAAAC AAG 

::aat:aagca 
g:ac:tagaa 

T jTGTCGGTT 

ta tgctttacta 
ga agtgaatgat 
ta 7£gagaagga 
aa t 3 g aaaac ac 
v£cataccg 
pgtagtccc 



;7gta 



SA 



EF1C2-4 (SEQ ID NO : 3 9 6 ) 



LDEKNQVA VKHLAINEYF WQETKAPEGY TLEETFY" 
IKKVDNNEKN AVITRDVTAK EQVI RFGFDF FKFAG. 
ITGAEDKATT ACNEQLGFDG ygkfenlpyg lyllf: 
dyakseyvf? iteegqkcpi kmvtvpyekl tn::ep. 
kdgijkklntl dftelvdklr ynlheikedw yvvar. 
tatlankekt gtwkilkklt aeqvddksiy lf;g;T- 
tvnctierhv siqtkahled gsqtfthgdv mdi-:fe 
dgtnkeiwks gkiehevndk eftxtvlaek vdt.=p 



AFTCF:;r:.SF kgsplsgtxe 
egfcki'iige irstfkenkd 
rlmlycl i ex ee sltslatw 
atkaage pte k akp wi aet 
afkagmfiva kdaslnnqaq 
hdvldgefea fetilyallp 
?-ftfte:::ye kogmvngkhn 
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TABLE 1. Nucleotide and Amino Acid Scqcucnc.-. . T I:.Jaccaiis Genes. 



EDLKEKSQTL TPKEVPTIPS TPKQPETPAV P": ' 



EF103-1 (SEQ ID NO:397} 



TAAGATAGGT 


TTATCAAAGA 


AAAGGAGCGA 




c: : 


. .TG A 




GT 


ATTAAGTTCG 


ATTACTTTAG 


TAACATTAAG 


TACGTTACTT 


A'; . 






ATGCAAGT 


GC 


AGCATTTGCA 


GATCATGCAG 


CCAATCCAAA 


TAGTGCTACA 


g:;- 






GC AAA CAT 


GA 


AAACAATGGC 


C AAA C AAG AG 


GCGACAAGGC 


GACTAAGATT 


rp-n - 






^nGA'"TG<~ 


"A 


AGGAACCCGT 


GTTTATGATG 


CTGCTGGTAA 


TGATTTAACG 








:taattt v i 


AT 


TGGTTTAGCA 


AAATATGATG 


GTGAAACCGG 


TTTTTACGAG 








AAAATACTGG 


GGAAACCCGT 


GGTGACGAAG 


GAACATTTTT 


TGTGACAGGT 






- r a a 


AACGAATTTT 


AATTTCGCGG 


ACACAAAATT 


ATCAAGCCGT 


AGTGGATTTA 






; ' i >■- : ^ 


GTAAAGA 1 : 


NfA 


ATTTACTTAC 


AAGCGTTTAG 


GGAAAGATAA 


ACTGGGGAAT 


g.-v: 






tttacgt: 


SA 


a:acatccct 


TATCATGGGA 


AAAAATTAGC 


TTTTACAAAT 


G J." 






:attaac-: 


AA 


g:aaactggc 


AAAATTGTGA 


CAAATAAATC 


AGGGGATAAA 


at'.: 






gaacgttc 


TG 


GAATGGCACA 


AAAGTCGTAG 


ATAAAAACGG 


TAATGATGTG 


a: a 




-■ .-<--■ ?\ 

- ■ ^ ^ . -i 


ATCAAAAT 


TT 


CATTAGTTTA 


GCGAAATTTG 


ATCCAAACAC 


AAGTAAATAT 


ga; 






ATTTA CAAAC 


AGGTGAAACC 


CGCGGCGACT 


TTGGGTACTT 


CCAAGTGGTG 


GAC 




/■.AC A 


AGATTCGG 


GC 


GGATGTATCT 


ATTGGTACGA 


ATCGTTACGG 


CGCGGCGCTA 


ga;- 


'T"'~ ;. 


■'■.CC-G 


AACTAAA'' 


AA 


TGATCGATTT 


ACGTATACTC 


GAATGGGTAA 


AGATAATGCT 


GG'I 




- ^ .~i 


TTCAA 3T'" 


TT 


ggtggaacat 


GAACCTTACC 


AAGGCACATA 


TCATCCAGCC 


TT'I 






AA 







EF103-2 (SEQ ID NO:398) 



MKKKVLSSI TLVTLSTLLI AGYASPAFAD HAAKi?:C 
TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENAWF 
DEGTFFVTGD GTKRILISRT QNYQAWDLT EG: v " 
HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTT! 
KFDPNTSKYE FFNLQTGETR GDFGYFQWD NNK I A 
YTRMGKDNAG NDIQVFVEHE PYQGTYHPAF TP 



yg;:;e:g;f.ef fdkntgetrg 
rl:;?;gkl:g^d vevyvehipy 
wd?;*.;gi-j:;vt aanqnfisla 



■L.E LTELNNDRFT 



EF103-3 (SEQ ID NO:399) 



TCATGCAG CCAATCCAAA TAGTGCTACA GC 
CAAACAAGAG GCGACAAGGC GACTAAGATT 
GTTTATGATG CTGCTGGTAA TGATTTAACG 
AAATATGATG GTGAAACCGG TTTTTACGAG 
GGTGACGAAG GAACATTTTT TGTGACAGGT 
ACACAAAATT ATCAAGCCGT AGTGGATTTA 
AAGCGTTTAG GGAAAGATAA ACTGGGGAAT 
TATCATGGGA AAAAATTAGC TTTTACAAAT 
AAAATTGTGA CAAATAAATC AGGGGATAAA 
AAAGTCGTAG ATAAAAACGG TAATGATGTG 
GCGAAATTTG ATCCAAACAC AAGTAAATAT 
CGCGGCGACT TTGGGTACTT CCAAGTGGTG 
ATTGGTACGA ATCGTTACGG CGCGGCGCTA 
ACGTATACTC GAATGGGTAA AGATAATGCT 
GAACCTTACC AAGGCACATA TCATCCAGCC 



A AT T " ■ ■" , G ^ C AAA CATC A AAA C AATGGC 







gggagtg: 


CA 


AGGAACCCGT 


GG.'-.G/:-- 




TTAATTTT 


WT 


TGGTTTAGCA 






AAAATACT 


'GG 


GGAAACCCGT 






AACGAATTTT 


AATTTCGCGG 


A" : 




GTAAAGA " 


NA 


ATTTACTTAC 






TTTA ZGT " 


-GA 


a:acatccct 


GGA G. ; . 




:attwc: 


AA 


TCAAACTGGC 


atttt;- 




CAACCTT" 


^TG 


G AATGGC AC A 


a:: a • 




/ATCAAAA" 


TT 


:attagttta 






ATTTACA : 


.AC 


AGGTGAAACC 


: j/.C.G-. . .■ 


..■ A 


AGATTCG-I 


;gc 


:CATGTATCT 






AACTAAA-: 


:AA 


TGATCGATTT 


GGTAAT . 




ttcaagt: 


:tt 


GGTGGAACAT 



EF103-4 (SEQ ID NO:400) 



HAANPNSATA NLGKHQNNGQ 

TRGDKATKIL SGTDWQGTRV YDAAGNDLTA EKANF . .AT. YDGETGF V EF FDKNTGETRG 
DEGTFFVTGD GTKRILISRT QNYQAWDLT E'G ' ". ■■' RLGKPKL^ND VEVYVEHIPY 



WO 98/50554 
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TABLE I. Nucleotide and Amino Acri . E. r.u\ * Ci ;nes 















At A 






K 7 Z P N T 3 K Y E 


FFN'L^TGETF. 






'-"'SI 








yyel; yacrfy 


YYPA^GKYNAG 


NEIC'VFVr. HE 


PYQ3TY:-:: A 














EE 1C4-1 ( SEQ IC KC:4C1 


1 














-GAAAGGGG" 


TTAGTA T~ AA 






AYTG 


A" AAAAA' 


TTIA- 


AG 


AAAAA ATTCYC 


G G A A AAAA T C 


TCGGGTTT A A 


rG - rtj-^-.T-..- . s r 




AY AAA 


: A; - '•' 


AC A. 


A A 


A G A A A AAA C A 


GAGAGGAGTG 




AAAAGAGG :A 


g;, 


. . AGA 


i : -.C . A--_ 


AAA- 


AA 


AYCTTTACCT 


TTAGCAA AAA 








AGTG 


:aaaa a 


AAA 


AA 


A AAA G AAAACC 




- 


x ,ccag.:";;,. 




.AAAA 




J ^ 


^ /-i 


AAA.A.ACAACG 


GAG AACAAGCA 




TAATGAA-^W. 




A AAG 






AC 


A CCAATTACA 




TTGACAGTAT 


GA3T"G r rG 13 






GAAAA 




YA 


■ AGA AAAAC A 


AAGGTT AACC 




ta:a.agA'L- :g 


GA AAA 


A 3YA 


AGYYA 




YY 


T'GCTGTTGGA 


AACACA AGTT 


ATA A ACCT A A 


at:agtytat 




:aat 










TAA'^TATCA 




ga:agg 'v.'-t 


G A A AA 


AG YY 


GG AAA 


ATA 


AA 


T'A A A ACGACC 


AAA AG ATT A A 


GTATTACGTA 


T A.A T ■„ > *. . 




A AT A 


••' - - : - 


prp' - 


,YA 


GTA AAACTA 


AAAAGTTA rp r > 


T AA G AA, CA jA 


gg:gga.ag':a 


A A .AAA 


-AAAA. 


YAGA..A 


AC A 


A A 


AAAAAA.AAAAT 


A A A A A A ACCT 


A G T G G TTTG A 


tttatatg;^ 




1AAG 


CAAYY 


_ 


YA 


a:aacgaacga 


AGG AGA ATGA 


uyuu j X X on 


TG jC^.^'vi . -l 


' ••' 


.A.AYT 




AG A 


AC 


: ccaavatggc 


AATCAAACAT 


TAGAATTATT 


aacaacaga.g 




AGCG 




TTA 


GG 


A„A.A-^CAAGAT 


AA^^TGGAAG 


GTG AAGTTTT 


ca^tta ggat 




ATT A 


AY AGA 


A AA- 


nT 


A'TTAACAGAA 




TGT^A A^ATA 1 


TGG C AAA 3 A T 


YAAAA 


YTAA 


G AG. AY 


.AAA 


T^ 


;-.CVC'2GGCGG 


ATTGCTGTA A 


rTGT^rr a aa 


C A TG AA T'Z A A 


AAAAA 


GGCT 


ATT AA" 


ATA. 


TC 


■AATT.AATCGG 


AG A A^^TATT 


TAi^ AG AGTG-" 


TTCGGACTA7 


AAAAA 


YYAT 


AYTAAA 


AAA 


A A. 


■TAT ACAACA 


AGAAAAATTG 


<3 — w /lii X X ^ ^ 


TTTGAAAAi Y 


AAAA A, 


.GGAA 


ATA.A.A 


AAA 


AC 


AAA A GG ATTTT 


AGTGCTAAGA 


GGAGT^AAA " 


AAG T AA. A : .T T A 


A-/T A ■ ' 


AATG 


AY A'lA 


ATG 


AG 


T'AGTATGTCC 


TATATGAGTT 


TT^ AAAGGAA 


AG 3GAAA.TA* Y 


f r>;, — 


ACAA 


TTAAY 


ag: 


AC 


ATTA.ACAGAA 


ACAAAAGTGG 


QT'Ai ^AA A^ 


CGYATYA AAG 


AA AA, A 


AACG 




GA.A 


A. T 


A AAAAATCCT 


AAAT^TACGG 


CGTATGGTAA 




A./ A "A" 


.AAAT 




ga; 




A'TTTGATATT 


AAAAC TGAAG 


GTGGC AAG T 


CACTYY/^A _G 


' - : ■ • 


AAAG 


A TAG- A 




YT 


AAG^TAAAT 


ATTTCTGATT 


TAACAA^GGA 


YYYTGAYAAG 


A A. .A • 


AYYA 


AT ATA. 


YCA 


YT 


A„A jTACACCT 


GTAATTGGTC 


CTAATAAAGG 


CAYTGA.AYYA 


G^ AAA A 


AATG 


aAT.'-lT 


AYA 




A. A AAATTAGT 


GTTGTTAATC 


CTTTGAATGG 


TGAAACYJ AY 




AAATT 


AT A AT 


AAA 


AA 


T : j A TGCC TAT 


TGATCAAGAA 


CAACTGTCTC 


AGYTATG 3GA 




- AAGA 


AA AAA 


at: 


A A 


A„AA TT T AG AA 


ATTAAAGTAA 


AGCATCCTAA 


TTATCTTTGA 


TYAC'G 


'-.GCTA 


CAAAA 


GAA 


AT 


TTATTTTTAT 


TACAAGTTAG 


GAACGGATTA 


TACAGTAAGG 


CC'AYvG 


. YCAG 


ATGGTTCA 


AT 


tattaagttc 


ACTACGCCAA 


TAACCAACGA 


AATCCAAATT 


.^Y" Z- A -t^ 


AGGTT 


TTAATTAY 


GT 


GAAAGATAGT 


TTGCCAAAAG 


ATAAAAGTAT 


CCCAGTGGAT 




AACGA 


TAACA 


ATG 


AG 


TGCTGAAGGT 


TTAAGTCCAG 


TTGATACGAC 


AGTAAGTA3T 


AA-.YAA 


: AAGC 


GTGGTTCA 


GA 


aagaacactt 


CAAAGTAGTA 


AAAATGAATT 


CCTTijTC AAT 


AAAAC 


' AATG 


ATTCT 


TTA 


GA 


ctaactaagc 


GTCCGTACAA 


AAATTCCAGC 


tgggg:y.iat 


GYYGA" 


'TTTG 


ACATTTAA 


AAA 


■tatttcaaac 


GATCAGGTAG 


ATTCAATTTA 


TCCACAAYAG 


YAAAA 


" AGCG 


GTCAA 


ta: 


YT 


taataaacca 


ATGACGCCAA 


ACAGCCCTGG 


ataty:aa::g 


A A A A 


TTTG 


ACGiV 


-AAA 


A. A 


a;^tagttac 


AGGTTTGATT 


TTGGAAAAAC 


CAAGAAAAGY 


YA A, AA 


" ATTG 


AGTAY 


AAA 


AA 


A A ACAATGGC 


TGGATCGACG 


TGCCAACTCT 


ttatat;w\ :a 


GA' '.. .A 


A 3CGA 


AAGAA 


ac; 




ATAGAATAAT 


AATGAAGGCT 


CTGCTTCGGT 


ttctgtyaaa 




-", ."5CGT 


TAG A A 


at: 




AA ATGCAACA 


CAAGCGGCGA 


ATCCAACATT 


AAAAAAY3TA 




AACGA 


cagt; 


ac; 




A-AAAAA T ATT 


GATAATAAAA 


CACATCGTGT 


gaaaaay:ya 




_ AAAT 


TAACA 


ac; 


AA 


A AGCACAACC 


AATGC TCAAA 


TCGATTTGAA 


tyctayyacg 




-AGCG 


tgcc; 


AAA 


GA 


T j ATTATTCA 


TTAGAGAAGA 


CTACAAACGG 


tgcgaaagtg 


A.YTT 


"AAAG 


ACTA. A 


ac; 


TT 


A AC AG AAAAC 


ATTACGATTG 


AATACAATAC 


GGTCTCY^Y'A 




AGGCC 


AAATC 


TAT 


A A 


A AAAACAACA 


ATCGACTCTG 


AAACATTGAA 


CCAGATGTGY 




^ .AAGA 


AAAAA 


ATC 




CACTGCGCCA 


ATCACATTGA 


AATTCTCAGA 


aggtgatgag 




: ATTG 


yytayyt; 




AACTGCCACA 
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TABLE 1. Nucleotide and Amino Acid Scqciieiic. s of E. faecal is Genes. 



TTCTACACGC 


ATAACGTAGA 


GGATGAAAAC 


caac 


.ATTG 


CGAAGGTTTC 


TTTTGAACTA 


ATTGATAATG 


TCACGCATAC 


AGCAACCGAA 


r T i— .- - ■ 


■ACAG 


ATGAAAAAGG 


TCAATACTCC 


TTTGATGCCA 


TCATGACAGG 


TGATTATACT 


TTOOC : 


OGTAA 


CGAATGTA 0 0 


GCAGGAATAT 


TCCGTGGATG 


AAGAGTATTT 


GACAGGAAAA 




OAAGC 


TGGTCAAAGG 


AGACAACCAA 


CTAAAAATTC 


CATTAACGAA 


AACAATTGAT 


CACAO- 


"CGTT 


TACAAGTCAA 


AGATTCAACG 


ATTTATGTCG 


GCGATTCATG 


GAAACCAGAA 


GAGA."-. 


ctttg 


TTTCAGCAAC 


AGATAAAACA 


GGTCAAGACG 


TTCCCTTCGA 


AAAAATCACT 


GTTT. . 


,\GCTC 


AAGTTGATAA 


CANCAAAGCA 


GGCGTTTATC 


CAATTATTTA 


CAGTGACGAA 


GGTAO 


AGAAG 


AAACAGCCTA 


TGTGACCGTC 


AAACCCGACC 


AATCTAAGTT 


AG AGGTC AAA 


GATA :. 


■ ACGA 


TTTATGTTGG 


TGATTCGTGG 


AAACCAGAAG 


ATAATTTCGT 


TTCAGCGACA 


oca;- 


■-AC AG 


GTCAAGAC3T 


RCCGTTTGAA 


AAAATTGATG 


TTCAGGGAAC 


AGTGAATGTT 


G 0TA-' 


A TAG 


GCGATTA7CA 


AATTGTCTAT 


AAAAATGGCA 


NAAAAGAAGC 


GAAAGCAATC 


GT7C : 


■'GTCC 


GTGATGACAG 


TCAGTTAGAG 


GTTAAAGATA 


CAACGATTTA 


TGTTGGT3AT 


tccc. 


AAAC 


CAGAAGA7AA 


TTTCGTTTCA 


GCAACAGACA 


AAACAGGCCA 


AGACGTTCCG 


TCTG.-" 


-A AAA 


TCACTGTTTG 


AGGTCAAGTT 


GATACTAGCA 


AAGCAGGCGT 


TTATCCAATC 




OAGTT 


ACGAAGGTAA 


AGAAGAAACA 


GCTAATGTGA 


CTGTCAAACC 


CGACCAATCT 


A AG 70 


AGAGG 


TTAAAGATAC 


AACGATTTAT 


GTGGGCGATA 


AATGGGAACC 


AGAAGATAAT 


Ti cc: 


TTCAG 


CAACAGA 0 A A 


AACAGGTCAA 


GATGTCCCGT 


TTGAAAAAAT 


TGACGTTCAG 


GGOA'. 


AGTGA 


ATGTTGATAA 


AATAGGCGAT 


TATGAAATTG 


TCTATAAAAA 


TGGCACAAAA 




OAAAG 


CAATCGT7CA 


TGTCCGTGAT 


GACAGTCAGT 


TAGAGGTCAA 


AGATACAACA 




CGTGG 


GTGATAAATG 


GGAAGCAGAA 


GATAACTTCG 


TTTCCGCGAC 


AGACAAAACA 


gc ro," 


■GACG 


TTCCGTTTGA 


AA AAA TTG AT 


GTTCAGGGAA 


CAGTGAATGT 


TGATAAAATA 


GG 0 A~ 


CTATG 


AAATCGTOTA 


TAAAAATGGC 


ACAAAAGAAG 


CGAAAGCAAT 


CGTTCATGTC 




.OGATA 


GTCGTTTACA 


AGTCAAGGAT 


ACAACGATTT 


ATGTCGGCGA 


TTCNTGGANA 




.,, ?NGA 


ACTTTGTCTC 


AOCNACAGAT 


AAAACAGGTC 


AAGATGTCCC 


ATTCGAAAAA 




'CGTT 







EF104-2 ( SEQ ID NO:402) 

MKKKTF SFVMLSILLA QNFGFAVNAY AVTTOTA. " " TTDTAKKEAE CSNSTPSLPL 
ATTTTSEMNQ PTATTESQTT EASTTASSDA AT PS EC *'TTE DKDTSLNEKA LFDVQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFC ."■ AVGN TGYAPGSVYT V£-LPEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNQRV S07SO-VELK SYLTTEAEFL J Y IETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YOCCCCLTGN QTLELLTTET PGAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTPGKDY TiSCCfLGRI AVTVPNMO'CQ KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TOTKCATDFT AKTS;>T£CVI ADREMRSMSY 
ISFQSKGKYY VTIYGTLTET KVGQQIVLES C: 0 ; 0 " KNPK FTAYC4PLYEN VKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDKK OIOCO; STPV IGPNKAI^LV SDQYIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVMGS K ZC' P ; : IJL.EI KVKHPNYI ?L FATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP I OFHYYPDSL PKDKSIP'.'DT I PITMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLVNA FOCSFI SLSV RTKI PAGAEV LFDIYDVSND 
QVDSIYPQYW DRGQYFDKPM TPNSPGYPTI TfCKMCHSYT FDFGKTNPRY 1 1 EYKNANGW 
IDVPTLYITG TAKEPQSNNN EGSASVSVQN EALDILSATQ AANFTLKI rVT KTTVTTKNID 
NKTHRVKNPT IELTPKGTTN AQIDLNSITV KGYPELAYSL EKTTHGAC VI FKDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLNQMSA SKKKVTTAPI TLKFSEGOAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATEF 77DEKCQYSF DAIKTGDYTL FVTNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH CI-.CCO^-DSTI YVGESWKOEE KFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDEG K E E T A *: "TTVK PDQSKLE' OKD TTI YVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD KC CCYEIVYK NGXKEAKAIV KVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF E0TTCC3QVD TSKAGVYCIV OSYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDNF WAT! KTGQD VPFEKID*'QG TVNVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI Y'/CCCVjEAED NFVSATDCTG QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAI VHVR E^EAjVKDT TIYVGDS'OXP EXNFVSATDK 
TGQDVPFEKI TV 
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TABLE 1. Nucleotide and Ammo Acni 5eq-.-: ■: ■ . :> of E. face*:!: - G :ie- 
ef:c^-3 ace; ig f r: : 4 : .■ 



TGTAA CAACGACAGA AGAACAAACA 

* in jL/viCArt Qn 1 . T ACTTC j-. G A-- A ~.- . . » .—-e-.- — 3 CAAC.- ..--. A ^ A-. .-. ~ AG CAAACC 

gag jAcaagg acacctga:t gaataaaaa/-, a a a a /tag atgtgaaaaa gaaaattaca 
gataaactac ttgacagta7 gagttttgag c a aav tggtg ga.icagaj.ga :agggaaaca 
gag attcacc gcgaattaaa tacaaca-cg gaaaaacgata cgt7c gaa.gg tg a7gttgga 

AACACAGGTT ATG A AC ACGG ATCAATT7A7 AAAAATAAAT TACCAGAAAA TTTAGGTTAT 
TCAACTGTCA GCG3AGAAG7 GACA 3GCATT GAG.- 'AAA VTA" GGGCAGTCGA TGCGGCGACC 

aaaa:attaa GTATTAAGTT TAA7CAACGA gtataagata cttcgttaag, agtagaacta 
aaaagttatc tan aaa gag a gg aggaa' gaa ';g' a a" caaaa ttgaagvaa go aggaaaaaat 
aaaaaaacct acta atttga tttatatgaa :aag:ggaag caattcaata taacgaacga 

ACGAGAACGA AGGGA7IVAGA TGG V AAAAAC^ G 777 A7AAC7 TAG A A AG A AG G AAAACTGGC 
AAT-AAACAT 7AAAA77AA77 AA I AACAGAG " ' A ."AGG GTGT A 777 GG A.AAACAAGAT 
AAC7TGGAAI A GG AAGT777 CAGT7ACGAT G L AAA C ATT A ATGGA'CAAAT TATACCAGAA 
ACGCAAACCT TGTTAA "AGG TGGAAAAGAA 1 " AGA:\A77AA GC GAT AAA 7C AOTCGGGCGG 
ATTACTGTAA ATGTTCCAAA CATAAATCAA AAAAAGvA TC7 ATTCC TTA7A GA77AATCGG 
ACAATTTATT 7AGAGAGTGC TTAGGACTAT AG*. AAAA ATTAT ATTCGGAG- A A AAA7CCAACA 
ACAAAAATTG GGTCAATTTC TTTGAAAAGT AGGA AAGGAA CTAAACAAAC AG,G AGATTTT 
ACTGCTAAGA CGAGTCAAAC AACTAAAGTA A7AGA7GA7C GTGA.AATG A." 7AG7ATGTCC 
TATATCAGTT TTCAAAGCAA A G G G AAA T A 7 7 A. 7 G7AACAA TTTATGGAAV AA^TAACAGAA 
ACAAAAGTGG GTCAACAAAT CGTATTAGAG AGGA G AAA CG GTCA.AGAA.AT 7AAGAATCCT 
AAATTTACGG CGTATGGTC: TTTATAT AAA AGAAAAAAAAT TGGAAGACTA 7777GATATT 
AAAACTGAAG GTGGCAAG AT CACTTTAA AG A' A . AAAAG ATAGATAA 77 AAGAATAAAT 
ATTTCTGATT 7AACAATGGA 7TTTGAGAA A AAA A ACAT7A ATCTATCA.7C AA V-TACACCT 
GTAATTGGTC CTAATAAAGC CATTCAATTA ' AAA' 'GGA7C AATAA ATA GA ACAAAATTAGT 
GTTGTTAATC CTTTGAATG: TGAAACTGAT A -V T.AA77 ATG A A AAA AA-. TGA7GCCTAT 
TCATCAAGAA CAACTGTCTC AGTTA7G AGA .AG A. .AGAGA AACCG ATTCA AAA T TT AG AA 
ATTAAAGTAA AGCATCGTAA TTATC77T AA 77ACAAGCTA CAAAA GAA AT TTATTTTTAT 
TACAAGTTAG GAACGGATTA TACAGTAACG AAAA " GTCAG ATGG7TCAGT 7ATTAAGTTC 
ACTACGCCAA 7AACCAACGA AATCCAAATT CCAA r VTGGTT TTAA7TATGT GACAGATAGT 
TTGCCAAAAG ATAAAAGTAT CCCAGTCGAT ACGA7ACCGA TAACAATCAG TG ATGAAGGT 
TTAACTCCAG TTGATACGAC AGTAACTA ZT A A AW -TAAGC GTGGA TCA GA ACGAACACTT 
CAAAGTAGTA AAAATCAATT CCTTGTCAAT GCA.A AAAATG ATTCTTTTGA CTC AC TAAGC 
GTCCGTACAA AAATTCCAGC TGGCGCGGAT G7TC7TTTTG ACAT7TATGA TG TTTCAAAC 
GATCAGGTAG ATTCAATTTA TCCACAATAT 7GGGACCGCG GTCAA TACIT 7GATAAACCA 
ATGACGCCAA ACAGCCCTGG ATATCCAACG A77A :TTTTG ACGAAAATAC C.AATAGTTAC 
ACGTTTGATT TTGGAAAAAC CAACAAACGT 7ACA7TATTG AGTAA AAAAA CC-VCAATGGC 
TGGATCGACG TGCCAACTCT TTATATAAAA GGGA :AGCGA AAGAA.CCAAA ATACAATAAT 
AATGAAGGCT CTGCTTCGGT TTCTGTTTAA AA7GAAGCGT TAG AC ATT 7T GAG7GCAACA 
CAAGCGGCGA ATCCAACATT AAAAAATGTA AAA A AAACGA CAGTAACAA : AAAAAATATT 
GATAATAAAA CACATCGTGT GAAAAATCCA AG GA7TGAAT TAACA.CCAAA AA A A ACAACC 
AATGCTCAAA TCGATTTGAA TTCTATTAC Z G7AAAAAGGCG TGCCAGAAGA TG VTTATTCA 
TTAGAGAAGA CTACAAACGG TGCGAAAGTC A7T77TAAAG ACTA7ACA77 GAGAGAAAAC 
ATTACGATTG AATACAATAC GGTCTCTGCA AACGATGGCC AAATGTATAA AGAAACAACA 
ATCGACTCTG AAACATTGAA CCAGATGTCT CGAA-CTAAGA AAAAA GTCAA CAATGCGCCA 
ATCACATTGA AATTCTCAGA AGGTGATGAG GAAG ATATTG TTTATTTAGC AAATGCCACA 
TTCTACACGC ATAACGTAGA GGATG AAAA C GAGvG AAATTG CGAAGGT r I'Tr TTTTGAACTA 
ATTGATAATG TCACGCATAC AGCAACCGAA T7TAAAACAG ATG AA AAA- GG TC AATACTCC 
TTTGATGCCA TCATGACAGG TGATTATACT A ." A A" AGTAA CGAATGTA A Z GAAGGAATAT 
TCCGTGGATG AAGAGTATTT GACAGGAAAA GAGA A TAAGC TGGTCAAAGG AGACAACCAA 
CTAAAAATTC CATTAACGAA AACAATTGAT GAGAA A7CGTT TACAA.GTC.AA AGATTCAACG 
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ATTTATGTCG 


GCGATTCATG 


GAAACCAGAA 


GA3AA 


TTTG 


TTTCA ACA/ 




AGATAAAACA 


GGTCAAGACG 


TTCCCTTCGA 


AAAAATCACT 


gtttc 


.GCTC 


AAGTTCAT, 


A 


CAJICAAAGCA 


GGCGTTTATC 


CAATTATTTA 


CAGTGACGAA 


GG7A A 


.gaag 


AAACA ACCA 


'A 


TGTGACCGTC 


AAACCCGACC 


AATCTAAGTT 


AGAGGTCAAA 


gaia : 


-A C CA 


TTTAT ATT." 




TGATTCGTGG 


AAACCAGAAG 


ATAATTTCGT 


ttcagcgaga 


GACAA 


AC AG 


GTCAAA-AC 




NCCGTTTGAA 


AAAATTGATG 


TTCAGGGAAC 


AGTGAATGTT 


gata \ 


A AT AG 


GCGATTAT 


A 


AATTGTCTAT 


AAAAATGGCA 


NAAAAGAAGC 


GAAAGCAATC 


GTTC A 


ACTCC 


GTGATAAC 




TCAGTTAGAG 


GTTAAAGATA 


CAACGATTTA 


tgttggtgat 


TC ATA 


;aaac 


CAGAAGAT 


A 


TTTCGTTTCA 


GCAACAGACA 


AAACAGGCCA 


AGACGTTC C C 


TTTGA 


AAAA 


TCACTCTT 






GATACTAGCA 


AAGCAGGCGT 


TTATCCAAT 


GTTT A 


AGTC 


ACGAAGGT. 


A 


A ■ A A A G AAAC A 


GCTAATGTGA 


CTGTCAAACC 


cgacgaat:t 




AGAGG 


TTAAAGAT 




AACGATTTAT 


GTGGGCGATA 


AATGGGAACC 


AGAAGATAAT 


ttggc 


:tcag 


CAACA AAC. 


A 


AAGAGGTCAA 


GATGTCCCGT 


TTGAAAAAAT 


TGACGTTCAG 


ggaa: 


AGTCA 


ATGTTGATAA 


AATAGGCGAT 


TATGAAATTG 


TCTATAAAAA 


TGGCACAAAA 




AAAAG 


CAATC ATT 


A 


TGTCCGTGAT 


GACAGTCAGT 


TAGAGGTCAA 


AGATACAACA 


attta 


TGTGG 


GTGATAAA 




GGAAGCAGAA 


GATAACTTCG 


TTTCCGCGAC 


AGACAAAACA 


GGTCAAGACG 


TTCCGTTT- 


A 


AAAAATTGAT 


GTTCAGGGAA 


CAGTGAATGT 


TGATAAAATA 


GGCCA 


TTATG 


AAATTATC 


:a 


TAAAAATGGC 


ACAAAAGAAG 


CGAAAGCAAT 


cgttcatgt: 


CGT jATGATA 


GTCGT ATA - 


"A 


AGTCAAGGAT 


ACAACGATTT 


ATGTCGGCGA 


TTCNTGGANA 


CCA3A 


AGNGA 


ACTTTATT 


ac 


AG CNACAGAT 


AAAACAGGTC 


AAGATGTCCC 


ATTC 













EF104-4 (SEQ ID NO:404) 
VTTTEAQTE TTDTAKKEAE LSNSTPELPL 

ATTTTSEMNQ PTATTESQTT EASTTASSDA ATPS" E'-X TTE DKDTSLNF A A L PC VQAPITD 
ELLDSMSLAP IGGTEYSQTE VHRELNTTPV TATFvFAVGN TGYAPGSV T YQAFEHLGYS 
TVSGEVTGIG ATWAVDAATK TLSITFNCFV SFTSFXVELK SYLTTEAEAL 1 K I ETPGKNK 
KTYSFDLYEQ VEPIQYNERT RTTGLDGEIF YKLC F.TLTGI1 QTLELATTAT I GAVFGKQDN 
LEPQVFSYDV DINGQILPET QTLLTFGKDY TLSCNSLGRI AVTVFI3MKA. Q KAYSLSINRT 
IYLESASDYN YLYSQQYPTT KIGSISLKST TGTKX.TTDFT AKTSC^SKVI AGREMRSMSY 
ISFQSKGKYY VTI YGTLTET KVGQQIVLES TI\G£EIKK?K FTAYGALYGN YKLEDYFDIK 
TEGGKLTLTA TKDSYLRINI SDLTMDFDK E IG ICLF EF7PV IGPNFAI C ~V SD;YIEPISV 
VNPLNAETAW GNYDQNGAYS SRTTVSVKGS KEAFTjNLEI KVKHPilYFAL FATKEIYFYY 
KLGTDYTVTP TSDGSVIKFT TPITNEIQIP 2 GFNYVFDSL PKDKSIPVAT 3 Fl TMSAEGL 
TPVDTTVTTN SKRGSERTLQ SSKNQFLYKA F.IAA SFDFLS'V RTKIPAGAAV LFCTYDVSND 
QVDSIYPQYW DRGQYFDEPM TPNSPGYPTI TFC EKTIISYT FDFGKTNFEY 1 1 EYKNANGW 
IDVPTLYITG TAI'vEPQSNNN EGSASVSVC.TI EAEEZ Lt-ATQ AANPTLK^.T KTTVTTKNID 
NKTKRVKNPT IELTPKGTTN AQIDLNSI7V KGVFEDAYSL EKTTI1GAFAAI FFDYTLTENI 
TIEYNTVSAN AGQIYTETTI DSETLN£M£A SFFFVTTAPI TLKFSEGDAE GIVYLATATF 
YTHNVEDENQ AIAKVSFELI DNVTHTATFE TTOEKGQYSF DAIMTGDYTG FA'TNVPQEYS 
VDEEYLTGKA IKLVKGDNQL KIPLTKTIDH SFLQVKDSTI YVGDSVJKPEE IJFVSATDKTG 
QDVPFEKITV SGQVDNXKAG VYPIIYSDFG KEE7AYV7VK PDQSKLEVKD TT IYVGDSWK 
PEDNFVSATD KTGQDVPFEK IDVQGTVNVD K 1 GIAYEI VYK NGXKEAKAAV HVRDDSQLEV 
KDTTIYVGDS WKPEDNFVSA TDKTGQDVPF EEITVSGQVD TSKAGVYPTV YFYEGKEETA 
NVTVKPDQSK LEVKDTTIYV GDKWEPEDIIF VA AATjKTGQD VPFEKIDVAC TVIJVDKIGDY 
EIVYKNGTKE AKAIVHVRDD SQLEVKDTTI Y7GCFWEAED WFVSATDK r "G QDVPFEKIDV 
QGTVNVDKIG DYEIVYKNGT KEAKAIVHVR ddsreqvkdt TIYVGDSWXP EXIIFVSATDK 
TGQDVPF 

EF105-1 (SEQ ID NO:405) 

TAAATGAAAA AAACAGTCGT CTACTCCTTG TTATTCGGAA CAATGTTG AT TGGGGCCACT 
GTTCCTGCTG AAGCGGCGAC GGTCGTTTTT GATAGCGAAC AGTCGATTGT TTTTACCCCA 
AGCACAGATG GGACGGATCC AGTAAATC Zh GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 
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TABLE 



Nucleotide 



Ami: 



r.:/:v C 



-.77ATG 



537C AA7777CTAA TG 

gcc7T7a:7g agcccagtgt t: 

accggcgccg gaac3tggat c/- 

CAACAAGGAC AGCAAGTAAA 7g 

AAAACCCCTA AGGA7GCAG7 AC 

CCAGTAAATA ATGGAGGGAA A r I 



"A - CGAATAGw 

. .fV-r ^, >-Lrvo TLj .* i_- . . . 

!7G GC7GGG7 7 7" 

■A -5 AATT A/*. A . : - 

" A 7 A A AAAG 7 7 : 7 '' 

:aa tcttactaa;: 




EF105-2 (SEQ ID NO:4G6) 



MK K 7vVY S L 7 F G TM 1 LGATV F AE A7 

YDPTNPDGPN PG7PGPLSIC YA77: 

YVQYSDLRG7 N AGWY LKVXQ ICG. 7:- 

NIQLCAAGAE TVVMQAPEXT GAG77 

TPKDAVQYKT TLTWLLSDVP VXTCGG 



tdgtdf"- v 7e 
araqty77y c 
ftepsyfg^a 

qgqqvka: : r r 



j ekpvrp 
:;:latan 

A? TATA 

■ :/rvPGK 



EF105-3 (SEQ ID NQ:407} 



GGCGAC GGTCGTTTTT GATAG7GAA 1 
AGCACAGATG GGACGGATCC AGTAA 
CCAGTCGATC CAACGAATCC TGATG- 
GATTATGCCT CAAGTTTGGA TTTTG- 
TTTGCCAGAG CGCAAACCTA TAG A.' 
AATTATGTAC AAGTAAGTGA TTTAv 
CAAAATGGTC AATTTCGTAA TGCA'C 
GCCTTTACTG AGCCCAGTGT T2G2". 
GC A A AC ATT C AA TT A G A TG C TZZZ 
ACCGGCGCCG GAACGTGGAT CA7G: 
CAACAAGGAC AGCAAGTAAA TGGC; 
AAAACCCCTA AGGATGCAGT AC 



ATCCA 



-GATTGT T77TACCCCA 

GAAAATCCCG ATCCAGAAA 

AA TCCAGGTA CCCCTG AG 

AATGAGATAT CGAATAAG7 

GATGGTTCA 2 CAAGTGA77 

ACCAATGCTG GCTGGG77 I 

7 /A I A r>*„AA S AATTAA A-*.- 

GCGACGGACS TATTGC :GC 

7AAa JI'j'.'j'j TCATGCmA^ 

3 G G 7 AAG CAG AAAAAG 7G ; 

AC AC GGG CAA TCTCAC7A; 



acca7ttcga 
a7tttccatc 
7 7aaacgtat 
ggcaactgct 
aaaa- 5tgaaa 
cggc a :cgtc 
aactgctacc 
::::agaaaag 

CG AAAAAAAT 

tgtt.:ctggg 



EF105-4 (SEQ ID NO:408) 



ATWFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVR? 

VDPTNPDGPN pgtpgplsid YASSLDFGSN EISNKDQTYF ARAQTYRK 



GSASELATAN 



YVQVSDLRGT NAGWVLKVKQ NGQF RNAFT7 HKELTGATVA FTEPSVRSMA TDVLPPTATA 
NIQLDAAGAE TWKQAPEKT GAGTOITLWG QAEKV7EKNQ QGQQVNA7IT RAIS7TVPGK 
TPKDAV 

EF10 6-1 (SEQ ir NO: 409) 



TAGTCGTTTA 7GAAGAAAAA AATC777GG7 ACAA77ACC7 TGTTGG Z 
TTAGTTGGTG GAGCAGGAGG GGCT77GACG GCAGAAGCA7 ACGTTC Z 
AATCCCAATA ATTTAGGGGA 7TTA777GA3 7ATT7ACG77 CAGTTG Z 
GAAGGATTAT CAGAAAAAGA TTGGGC7GGA A7ACGCGTTT ATGATC7 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAA7 TTGATG: 
TATGAATTTT TTGATAAAGA GACT3GAGAA TGAACAGGAG ATGAAG7 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAA: 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATA: 
GATAAAAAC G GAAATGATAT TGAA7T77TC GTAGAAAACA AAGCAAC 



777 AAGTGCGTTA 
I"CA AAGCGTAGAC 
7 AT TAG A 7 AAGAT 
'■.AA TGGGAATGAC 
AAC CACTAGTTTC 
AAC CTTCTTTATG 
C.AA AAATTATCAA 
AC A G A TG j GG AAA 
7 TC AGGAGCAGTT 
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TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA 

CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT 

ATTGG TAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGGC AC AAAAGTAGTG 

GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGGGAAATTT 

GACCAAGATA GCAGCAAATA TGAATT CTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC 

TATGGCTACT TTAAAGTAGG AAATCAAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC 

AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA 

CGAATGGGTA AAGATAACGA AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC 
CAAGGAACTT TTAATCCTGA ATTTACCTTT TAA 

EF106-2 {SEQ ID NO:410) 

MKKKIVGT ITLLALSALL VGGAGGALTA EAYVPQSVDN PNNLGDLPPY LRSVGIRQDE 

GLSEKDWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT 

AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV EHKATSGPVY 

GRPQPYPNNR PRTLEFTNGR RAMTEQTG2I DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 

KDGNDVTSAN QNFISLAKFD QDSSKYEFFN LQTGETRGDY GYFKVGIJQUK FRAHVSIGTN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GTFNPEr TP 

EF106-3 (SEQ ID NO-.411) 

AT ACGTTCCTCA AAGCGTAGAC 

AATCCCAATA ATTTAGGGGA TTTACCTGAG TATTTACGTT CAGTTGGTAT TAGACAAGAT 
GAAGGATTAT CAGAAAAAGA TTGGGCT'CGA ACACGCGTTT ATGATCGAAA TGGGAATGAC 
TTAACAGATG AAAATCAAAA CCTATTACAT GCAATCAAAT TTGATGCAAC CACTAGTTTC 
TATGAATTTT TTGATAAAGA GACTGCAGAA TCAACAGGAG ATGAAGGAAC CTTCTTTATG 
ACCGCTGGTA TTACAGATGT TTCCCGTCTT GTAATTATTT CTGAAA CCAA AAATTATCAA 
GGTGTATACC CACTTAGAAC TTTATACCAA GATACTTTTA CGTATAGACA GATGGGGAAA 
GATAAAAACG GAAATGATAT TGAAGTTTTC GTAGAAAACA AAGCAACCTC AGGACCAGTT 
TATGGTCGTC CGCAGCCATA CCCCAATAAT CGTCCCAGAA CACTAGAATT CACGAATGGA 
CGCCGTGCCA TGACAGAACA AACAGGCCAG ATTGATGTAA ATCGACAAGG GGATGAAATT 
A TTGG TAAAA CTTCCTTTGA TGGGACACCG CAACTTCTTT GGAATGG CAC AAAAGTAGTG 
GATAAAGATG GCAATGACGT AACTTCGGCC AACCAAAACT TTATCAGCTT AGCGAAATTT 
GACCAAGATA GCAGCAAATA TGAATTTTTC AATTTACAAA CTGGTGAAAC TCGTGGCGAC 
TATGGCTACT TTAAAGTAGG AAATC AAAAT AAATTCCGTG CCCATGTTTC CATTGGAACC 
AATCGCTATG GCGCTGTCTT AGAGTTAACA GAATTGAATG ATAATCGTTT TACGTACACA 
CGAATGGGTA AAGATAACGA AGGAAACGAT ATCCAAGTCT ATGTGGAACA TGAACCATAC 
CAAGGAACTT 

EF106-4 ( SEQ ID NO:412) 
YVPQSVDN PNNLGDLPEY LRSVGIRCDE 

GLSEKDWAGT RVYDRNGNDL TDEN£PLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT 
AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY 
GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDEII GKTSFDGTPQ LLWNGTKWD 
KDGNDVTSAN QNFISLAKFD QDSSKYEFFN LQTGETRGDY GYFKVGNC'CK FRAHVSIGTN 
RYGAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GT 

EF107-1 (SEQ ID NO:413) 

TAAAAAACGG CACTCAATAT GTCAAAATTT GAAATTTCAA GCTGTGCGTT CTTTGGTAAA 
ATANATANAA AAATGCTAGT TATCAGTATC GATAATAACA GGATACTCAT TAAGAAAGGA 
CTTTATAGAG ACTATAGATT GAATTTTTAC ATAGAAAGAA GGAGCAAGAT GAAGCGAGTA 
AATTGGAAAA GATGGCTAGT TGTTGGGTTA AGTTGTTCTT TGTTCATGGA TTCAGTGGTT 
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TABLE I. Nucleotide and Ammo Acid Se 


qcuences of a. 


lac calls ( ' 






GG TGTG AC TG 




AACGA77AC7 


ggggcgacgc 


agcaagg; 


. : : 


AGC AACATCT 


CAGTCGAGTG 


A CGAA Z CGAG 




/ iAj-\ '„ >*_-1. ^- 'v_ C 


■H-'-.'kjAsJ - Z ; 


:a 


CCCA.ACGGTC 


GCTAGTGAAG 


■~ 'j/i/Vlrt _ A J 7 




GAAACGGCAA 


GAAT7GCT 


^ 


7 ::gag:gatt 


GGTTA77C77 


C 7G 7G G AAGG 


GCG C j AG ATT 


CCCC777TCC 


TTGTGGAG 




AGAC 3GGACG 


TTGTTTGATC 


CCGACCGAAT 




GT i „ >iA. , CI' ^ . 


CCACG77C 


'" c 


CTT7CA7GAA 


'jA ; -tkaatta r 


AA IGAA TC CC 


:ctt^aco: : 


acca:tgtga 


atggcgga 


A^ 


^rprj-i^ "** *~GTCT 


ATTCCAACG7 


CACCAGCTTT 


T AAA 7 A. TG AT 


AC AAACA-A T C 


-M.GAA - w ^ ~ 


*, ~- 


7A-ATATTTAT 


GG CGTCTCTG 


AAGTGTCGTT 


ta:tattcct 


AAGGAG PAT " 


aaagcct; 


-:A 


C ATT TGACCA 


AGTACGTTT7 


ATACAGGACA 


ICACTACGCAA 


- A 1 ' Aj l j - 


CAACGGTC 




7GC C.AACGTT 


GGGGGCAAAG 


TGACGAACTA 


7GTGGG "GCG 


AATG CGCACA 


CGGAA7TC 


GA 


gtta.a:caat 


^aAAAAATG^ 


CCAA7AAGC7 


GA:C777GGT 


C T AAAAA j A 


:gttcaaa 


7 A 


TAC C 37AGCT 


A C GG C AC C AG 


GAGGCGTTAC 


CTATGCGCTG 


ACCTATTT77 






:gg cggtcca 


ACTAGTTCGC 


A C C AAAG A C G 


AGGAACAGCG 


GGT " "*TGTC~ 


ATTATTAC 


z n r 


C AAAGCGG 


CGTGTCACGG 


AAAAA TTT" G A 


GAATCCCGCA 


GGCGGGGC CA 


TTCCTGCG 




AGAAGGTTAT 


ACGCAGGATA 


AGAAAACCAT 


TGTAACACGG 


GAGGATTT7A 


:ttttac: 


A 


AGA.^^GCACC 


ttg:ctgaac 


GTTACACAGG 


TAG7GA7GGG 


AAGACGTAT7 


TATTTAAA 


G ^ 


TTGC7ACAAA 


GGGAATGCGA 


AACC TAGCAC 


GTTGGAAACC 


ACCAAAA.CG T 


CTAGTTAC 


■ r 


GC^TCACCTAT 


GATGACAATG 


ACGA7TTGCA 


tgtggt:tat 


GAAGAAG CAG 


TGATGAAA 


A " 


:tatacgttg 


CC AGO GAG AG 


AAGC77TGTT 


CGGCTATGTT 


GATGAGCAAG 


GAAACTTC 


A 7 


taat:ccgcc 


AAGTTTAAGG 


7AAG7GCGAC 


CATGGGTGAA 


AGTGACGGA ^ 


ccacagg: 


GA 


/l-AT^ACGACT 


TTTCCCACAA 


7TGA7GGAA7 


CGATA7GCCA 


GCAAGTCAAT 


TAAAGAAA 


77 


AGCCATCCCG 


CAAAAAGTCT 


ACACACGCCC 


AGAC CATGGG 


ACAATGCTAA 


CTTATGCC 


c r 


g:aagaagtg 


AGTGTTGAAA 


TTCC7AAG7A 


TTACCAGACG 


ATTTCGATTT 


CACCAACT 


AT 


TGCGTATACA 


GGGGATAAAA 


CCAAGTATCC 


AGTACCAAAT 


GAAGTGCGC C 


GTGGCATC 




AAACCCCGAC 


AACATTGTTA 


GTAGTTTAG7 


GGGAANCXCT 


GCGTA7AAC7 


tgacccaa 




A„AGTGCCACA 


CGCTATACTG 


CCCGCCGTTC 


TTACTGGANG 


TGGGGCCCCA 


cgaaga: a 


CT 


TTACTCAATG 


AGTATCTATT 


CAGGAACTGC 


TGGGGGCAAC 


TATAATTTA 7 






tgg:accatt 


TATTATTAC7 


TAGAAAATCG 


GCGGGTCACT 


GAACATTTTG 


TAGACGAA 


AG 


TGGCGCAAAA 


ATCACGCCAC 


CAACTGGC77 


TAG AC AAGG A 


aatcag:tag 


tggtgga: 


AG 


TG.A.\AACTAT 


GTC7ACACTG 


TCGCAAAAGC 


TTT3CCGAAG 


ATCTACCAAG 


CTGGTGAA 


.AA 


a.ag:tatatc 


TTCCAAGGCT 


GGTTTAAAGG 


C AAAA ~ '_/\A<j3 


CCAGCAACA7 


T AAAG AC G 


AC 


A.ACGACCCCA 


AGTTTTACAC 


CAAC7TTTAA 


TGATGAGGAC 


GACATGACC Z 


CTGTGTA1" 


GA 


A 1 Z AAG C G ATT 


CCCACCGCGG 


AACTAACGTT 


AACAGGTGCC 


GTTGACATAA 


TCGAAAA7 


GG 


CGCCACAATG 


GATTACTGGG 


AGGCGCTACT 


GAAGAACACA 


ggcgaagcg: 


CGTTAAC C 


A Z 


CAT7AAAATC 


AAGCCAACGG 


CAAC7TGGGC 


GGC7GGCATG 


GGCGCACCCA 


ACACGATA 




TG TAC AAGG A 


ACGGGTCAAA 


ACACCAAAGC 


TTTTCCTGTC 


ACCAAAGAAC 


AATGGACG 


A — 


CGGTGCAGGA 


GTGTCCATCA 


CGTTGGATCA 


GCCTTTACCA 


GCTGGCGGT- 


AATTAAAA 


AT 


GAACTTATTA 


GGAACCGCCG 


TTACAGGAAA 


TCCTGGTCAA 


GTTTTAACCG 


CTGATGTT 


GA 


AGTAACGGGC 


AACTTTGGCA 


GTTTAACTGC 


CAAAGATACG 


GTCCGTAT7A 


AAGACTTA 


GA 


T C AAGAAATT 


ACGAGTCCTG 


ACGGCGACGG 


CTTTATTAGT 


ACCCCGACA7 


TTGATTTC 


GG 


TAAACTAGCA 


ATTTCAGGAA 


GTAAGCAACA 


ATATGGTTTG 


AAGAAGGCC Z 


C AG ATT AC 


7 A 


CGGCAATGGC 


ACTCGCAACC 


CTTATTTACG 


CCTGAATACT 


AGCCAAGCCA 


ATTGGAG 7 


I: 


AACGGCCCAG 


CTATCGCAAC 


CAAAATCAGC 


CACAGACAGC 


1 ICsC'-AA^ma 






1 i' 1AGGA 


ACGGCCGCTG 


CTGCCAGCTT 


TACCGATTAC 


AACCAACCAA 


cagaaaj: c 


AG 


GACACCACTT 


GGCAAGACCA 


GCACCGTGAC 


TTTAACCGCC 


CjA<^ AA i A^. _ j 


LAACAG 1 ^ 




GG ^C'^'-AAAC 


CAACAGTTCA 


CAGGCAGTGA 


CGTCTATCAG 


ttggacttga 


cgtttgc: 


AA 


GATCAAACTA 


GAAGTGCCAG 


CCAACCAAGG 


TATGGCTGGC 


caacaata: : 


aagccgcc 


C7 


CACGTGGAAT 


TTAGTGACTG 


GCCCCTAA 












EF107-2 <SEQ ID NO:414) 










MKRVN 














WKRWLWGLS 


CSLFMDSWG 


VTVLAETITG 


ateqgvateq 


ssdeae;- 




TTEESQATVA 


SEAKTVPPQE 


TARIASRAIG 


YSSVEGREIP 


FFFVEEDG7L 


fdpdrit:-- 




NLSTFSFYEE 
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TABLE 1. Nucleotide and Ammo Acid Scqeuences of E. faecalis G ^nes. 

KLQRTPLEPT TVNGGKLLSI PTSPAFKYDT NNQNPSNIYG VSEVSFTIPK. EYQSLDIRPS 
TFYTGDTTQY PVPTVFANVG GKVTNYVGAN AETELELTUE KMPNKLTFGP KKTFKYTVAT 
APGGVTYALT YFYGDVGGPT SSHQRRGTAG PVYYYLTKRR VTEKFEi:?AC GAIPAPEGYT 
QDKKTIVTGE DFTFTQEGTL FERYTGSDGK TYLFKGWYKG NAKPSTLETT XTPSYAVTYD 
DNDDLHWYE EAVMKTYTLP AREALFGYVD EQGNLIHPAK FKLSATHGEE DGATGEMTTF 
PTIDGIDMPA SQLKKLAIPQ KVYTRPDDGT IVTYGPQEVS VEIPKYYHTI SISPTTAYTG 
DKTKYPVPNE VRRGIENPDN IVSSLVGXXA YNLTQKSATR YTARRSYWXW GPTKTLYSMS 
IYSGTAGGNY NLSTPDGTIY YYLENRRVTE HFVDESGAKI TPPTGFT03N QLWDSENYV 
YTVAKALPKI YQAGEKTYIF C'GWFKGKTKP ATLKTTTTPS FTPTF::::EDD KTAVYQEAIP 
TAELTLTGAV DIIENGATMD YWEALLKNTG EAPLTTIK- K PTATWAAGIG APNTIFVQGT 
GQNTKAFPVT KEQWTTGAGV EITLDQPLPA GGQLKMNLLG TAVTGI iPG^V LTADVEVTGN 
FGSLTAKDTV RIKDLDQEIT SPDGDGFIST PTFDFGKLAI SGSKQ^YGLK KAADYYGNGT 
RNPYLRLNTS QANWSLTAQL SQPKSATDSL PTTTRLLLGT AAAASFTIiYi: CPTETRTPLG 
KTSTVTLTAD NTATAWANQ C'FTGSDVYQL DFTFANIKLE VPANQGI lAGv C YQAAVTWNL 
VTGP 

EF107-3 (SEQ ID NO:415) 
GG AGCAAGGAGT AGCAACATCT 

CAGTCGAGTG ACGAAGCGAG CCAGACGACG CAAACAACCG AAGAGTC/GA GGCAACGGTC 
GCTAGTGAAG CGAAAACAGT ACCGCCACAG GAAACGGCAA GAATTGC'ITC TCGAGCGATT 
GGTTATTCTT CTGTGGAAGG G C G C GAG ATT CCCTTTTTCT TTGTGGAGGA AGACGGGACG 
TTGTTTGATC CCGACCGAAT TACGATGGCG GTCAATCTTT CCACGTTTTC GTTTTATGAA 
GAGAAATTAC AACGAACCCC CCTTGAGCCC ACCACTGTGA ATGGCGGAAA GTTACTGTCT 
ATTCCAACGT CACCAGCTTT TAAATATGAT ACAAATAACC AGAATC GAAG TAATATTTAT 
GGCGTTTCTG AAGTGTCGTT TACTATTCCT AAGGAGTATC AAAGC ::TGGA CATTCGACCA 
AGTACGTTTT ATACAGGAGA CACTACGCAA TATCCAGTGC CAACGGTAr: 1 TGCGAACGTT 
GGGGGCAAAG TGACGAACTA TGTGGGCGCC AATGCGGAGA CGGAATTAGA GTTAACCAAT 
GAAAAAATGC CCAATAAGCT GACGTTTGGT CCTAAAAAGA CGTTTAAATA TACGGTAGCT 
ACGGCACCAG GAGGGGTTAC GTATGCGCTG ACCTATTTTT ATGGAGAGGT CGGCGGTCCA 
ACTAGTTCGC ACCAAAGACG AGGAACAGCG GGTCCTGTGT ATTATTA TTT AACAAAGCGG 
CGTGTCACGG AAAAATTTGA GAATCCCGCA GGCGGGGCGA TTCCTGCGGC AGAAGGTTAT 
ACGCAGGATA AGAAAACCAT TGTAACAGGG GAGGATTTTA CTTTTACCCA AGAAGGCACC 
TTGCCTGAAC GTTACACAGG CAGTGATGGG AAGACGTATT TATTTAAAGG TTGGTACAAA 
GGGAATGCGA AACCTAGCAC GTTGGAAACC ACCAAAACGC CTAGTTAT3C G3TGACCTAT 
GATGACAATG ACGATTTGCA TGTGGTCTAT GAAGAAGCAG TGATGAAAA1' CTATACGTTG 
CCAGCGAGAG AAGCTTTGTT CGGCTATGTT GATGAGCAAG GAAACTTGAT TAATCCCGCC 
AAGTTTAAGC TAAGTGCGAC CATGGGTGAA AGTGACGGAG CCACAGGG 3 A AATGACGACT 
TTTCCCACAA TTGATGGAAT CGATATGCCA GCAAGTCAAT TAAAGAAATT AGCCATCCCG 
CAAAAAGTCT ACACACGCCC AGACGATGGG ACAATCGTAA CTTATGGC 3C GCAAGAAGTG 
AGTGTTGAAA TTCCTAAGTA TTACCAGACG ATTTCGATTT CACCAAC r AAC TGCGTATACA 
GGGGATAAAA CCAAGTATCC AGTACCAAAT GAAGTGCGCC GTGGCATG 3A AAACCCCGAC 
AACATTGTTA GTAGTTTAGT GGGAANCNCT GCGTATAACT TGACCCA-AAA AAGTGCCACA 
CGCTATACTG CCCGCCGTTC TTACTGGANG TGGGGCCCCA CGAAGAAA :T TTACTCAATG 
AG TAT CT ATT CAGGAACTGC TGGGGGCAAC TATAATTTAT CGACCCCT3A TGGCACCATT 
TATTATTACT TAGAAAATCG GCGGGTCACT GAACATTTTG TAGACGAAAG TGGCGCAAAA 
ATCACGCCAC CAACTGGCTT TACACAAGGA AATCAGCTAG TGGTGGAGAG TGAAAACTAT 
GTCTACACTG TCGCAAAAGC TTTGCCGAAG ATCTACCAAG CTGGTGA-AAA AACCTATATC 
TTCCAAGGCT GGTTTAAAGG CAAAACCAAG CCAGCAACAT TAAAGACGA" AACGACCCCA 
AGTTTTACAC CAACTTTTAA TGATGAGGAC GACATGACCG CTGTGTACGA AGAAGCGATT 
CCCACCGCGG AACTAACGTT AACAGGTGCC GTTGACATAA TCGAAAATVGG CGCCACAATG 
GATTACTGGG AGGCGCTACT GAAGAACACA GGCGAAGCGC CGTTAA:CAC CATTAAAATC 
AAGCCAACGG CAAGTTGGGC GGCTGGCATC GGCGCACCGA ACACGATATT TGTACAAGGA 
ACGGGTCAAA ACACCAAAGC TTTTCCTGTC ACCAAAGAAC AATGGACGAC CGGTGCAGGA 
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TABLE 1. Nucleotide and Ammo Acid Scqeucncc; cf 



taccahs 7 ,-ncs. 



AAT7AA7 / A7 ...AAA;. 77ATTA 




A.TTTGAGGAA GTAAGCA^ATA z-^TA^ G---T7TG AAG. 



cgggaatggc 



act:3:aa:: cttatttacg gctgaatait agcgaagc::., attggac" 77 aagggcccag 
ctatcgcaac caaaatcagt 3 ac ag ac ag r ttgccaa7aa cgaccc : " 77 c77gctagga 



GGCAAGACCA GCAC IGTGAI TTTAACCGCC GACAATACCG CAACAGC ACT GG TAG C AAA C 

CAACAGTTCA CAGG:AGTGA CGT:7ATCAG TTGCACTTCA cgtttgcaaa ga.tcaaacta 

GAAGTGCCAG GGAACCAAG3 TATGGCTGGC CAACAATAC : AAGCCCCACC CACG7GGAAT 
TTAGTGACTG GCCCTT 

EF107-4 (SEQ ID NO : 4 I 6 ) 



EQGVATSQ SSDEAS^TTQ TTEESQATVA 



SEAKTVPPCE 


TARIASRAIG 


YSSVECREIP 


FFFVEEDGT 




fdpdrig; 




ALATFSFYEE 


KLQFTPLEPT 


TVTOGGKLLS I 


PTSPAFKYDT 


NNQNPf NIY 




VSEVSFT" 


FK 


YOSLDIRPS 


TFYTGDTTQY 


PVFTVFANVG 


GKVTt IYYG AN 


aetelelt:: 




KMPNKL7: 


GP 


Y KTFKYTVAT 


APGGVTYALT 


YFYGDVGGPT 


SSHQRRGTAG 


PVYYYLTKE 




vtekfei;: 




GAI FAPEGYT 


QDKKTIVTGE 


DFTFTQEGTL 


PERYT3SDGK 


TYLFKGVA'?; 




hakpstr: 




r.TPSYAVTYD 


DNDDLHWYE 


EAVMKTYTLP 


AREALFGYVD 


EQGNL3KPA 




FKLSACT: 


■ES 


ogatgemttf 


PTIDGIDMPA 


SQLKKLAIPQ 


KVYTPPDDGT 


IVTYGPQEV 




VEIPKYY. 




A ISPTTAYTG 


DKTKYPVPNE 


VRRGIENPDN 


IVSSLVGXXA 


YNLTQKSAT 




YTARRS7'. 


"\W 


:<ptktlysms 


IYSGTAGGNY 


NLSTPDGTIY 


YYLEt.'RRVTE 


HFVDESGAK 




TPPTGFT" 




7EVVDSENYV 


YTVAKALPKI 


yqagektyif 


qgv;fk UKTKP 


ATLKTTTTP 


c 


ftptfne: 


~G'"^ 


xtayyqeaip 


TAELTLTGAV 


DIIENGATMD 


YV* 7 EA1 GKNTG 


eapiattik: 




PTATWAA. 




APXTIFVQGT 


GQNTKAFPVT 


KEC'WTTGAGV 


site: ;>plpa 


GGQLKMIILIj 




TAVTGNPv 




LTAGVEVTGN 


FGSLTAKDTV 


FIKDLDQEIT 


S P DG ""jFIST 


ptfdfgkla 




SGSKQCV, 




7AACYYGNGT 


RNPYLRLNTS 


QAI'JVJSLTAQL 


sqpkcatdsl 


ptttrlllg 


T 


AAAASFC: 


- i i v 


C'PTETRTPLG 


KTSTVTLTAD 


N T ATA VVAK Q 


QFTGb JVYQE 


dftfanikl 




VPANQG17 




C.'YQAAVTWNL 



VTGP 

EF108-1 (SEQ ID NO:417) 

TAATCGGTTT GGCGGGAATC GTACATAGAA AGAAGGGAC 2 . ACATGAAGCA AA C T AAGTGG 
CAACGATTAG CAACCATTGG CTTGTGTAGT TCTTTAGTAA TTAACGC CTT TTCTGGTGTG 
ACGGCAGTTG CGGAAACCGT GACGATTGAA AGTAGTCCC-A CCGCCGAAAG TAGTGCCAAG 
GAAGAGACGC AAGCAAGTAG CGTGAAGGAA GAAACAACGA AAGCCAC7AC GGAAAATAGT 
CAAGTAACAA CTGACACGAG TCAGGAAGAA GC AACGAAA 3 AAGCGGA AAA AGAAGAACCG 
CAAGCAGAAG TGGAACAAGC AGAAACACCA ATCATTCCTA AACCAAA AAA AATCAATATG 
AAGGCAACTT ATTCATTTTC TGCAGAAACT TATCAGTTTC GATTTG7 . AA TGAATCAGGT 
CAATTAATAA ATCCAGATAT TATACCAATT ACGTATAG AC ATGCCAA-.G G ATCATGGAAG 
ACAGATGGTT ATAATGGAAA GTGGACTAGT ATGGTTCAAG GGAGTG TITC AACCGTAGGA 
AACTTAAAGA ATG TAATAAT GCCAGCAACT TCTGTAGTTA TGCCAC AAGG AGCGTCATAT 
GAAGGAACTC AAGAGGTGTA CACAAACTTT TCAATTCGGA TAG C AAA. ATA TTATGCATCA 
GCGAGTCTCT ACAATAGAGA AGGTAAAATT GATTCTACTT ATCCGTTACC TGCTATTGCA 
CTAGCAGGTA CTAGACCGCT ATCTTTGACT CAAAGTAGTG TAATTAG7GC ATTGGCGCTG 
ACCAGTAAAG GAGACAATGT TTATACACCA CGGGAAACAT TTTTTGGA3G AGATCCTGCA 
GGTGTAAAGT TTACTAATTT TTTGTATCGT ATAAATGACT TTGATGTGAA A G G T AATAAC 
ATAGGTTATA AGACTGTGAG TAGCCCAATC TATTACCATC TGACCAACCG GCGTGTCACC 
GAAAACTTCG TAGATACAAG TGGCG TCAAA ATCACGCCAC CAAGTAA7TT CACCCAAGGG 
AAACAAACGG TCATTAACAG TGAT" "TTAC ACGTTCCAAC AAAGTGG77T 77TACCCGAG 
ACCTACAAAG TTGGCACGAA AT:T7A^CGA TTCAAAGGCT GGTACA.A-.GG GAAAACCAAA 



ACGGCCGCTG CTGCCAGCT7 TAG GG AIT A 



GACAC^ACTT 
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TABLE 1. Nucleotide and Amino Acid Seqeuences of E. jaecalis Genes. 

ACCGAGCCTT TGGCCACCAC TAAAACACCT AG C TAT AAA G TCACGTACGA TGACAATGAT 
GATTTGACGG TGGTCTATGA GGAGTTTTCA GGGTACGAGC TGCCTGCTTC GACCAATCAA 
TTTGGCTTTG TGGATGAAGC GACGAACAAA TTAATTGCCC CCGACCAA3T GCAGATGAAG 
TATAATCTTA CTTTAAATGA AAATAATAAA AAAACAGTAA TGAGCAGCAA CTTAACGGGG 
ACAGATACAG CGACACTGAA AAACTT 7TCC GTGCCTGTC A ACTATTT'V 3 A ACAATATCGC 
GTCAATACGT TTTATGGCGC GAGTGACATT ACGTTTACAT TGCCCAAA CG GTACAAATCA 
ATCAATATTA CCAAATCAGA TGGCAAAACC GACCCAGCTT TTCCTCTCCC TAAAATCTAT 
AATATAGATC AAGTAGAAAT GTCAC ACATG CCTGTGACCA CTTATAA AAA GTT3AAACAG 
CTGTCGGGCC AAACGTTTGG CTTTAATGCT TTAGCCGATC AACCTGAC.CT TTATACGAAA 
ACGTTATTTG GGACAGAGTC TGGCATCGAT GACCCAGT3A ATTATTAA AC AATGAGTGGC 
CCTGTTTACT ATTATTTAGA AAACCGCAAA GTCACCGAGA ACTTCGTAGA CAICAACGGC 
GCTAAAATCA CACCGCCAAC AGGTTTCACC CAAGGTAAAA AAACGGTGAT TACAAGCGAC 
GCCTACACTT TCAAACAAGC AGGCACCTTA CCAGACACTT ACACAACAGG CGGTAAGACC 
TACAAGTTCA AAGGTTGGTA CAAAGGCAAG TCCATACTCA AC AC ATT,' AT AACTACCAAA 
GCGCCAAGTT ATCAAGTGAC CTACGATGAC AATGATGATT TGAATGT3GT GTATGAAGAA 
GAAACAGTTA CGACAGTGTA TCCATCAGTC GATATGAACT TTGTGAATGA AAAAGGCGGG 
GCTTTCACAC CGGCGTTAAC TTTTAGTGGT AAGTACTATG CGCAAAG.A:: GAGTGCGTAC 
TTAAGAACCG ATTTATATGA CGTGACCTCA AAAAATAATG GTAATGGC GA ATATACGGTA 
AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAAAC rA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAATT7 3 3 CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATTCAATTA GACACAGCCC AAAGTAGCAA TT?3AAATCC 
TATAGATATG TGTACACGAA CAATAGCTCA CTGGTTTTCG ACCCAAA: GT AGCACCAGCA 
GAGGTTGACC TTAGTTCAGA ATCTCTTAAC TTGCTTAATT TTGATTCA. GA TG3CACCTAT 
TTTTCTAATG CAAATAATAG ACTTTTTTAC ACGCATTTAG GATATAC r : 33 CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTTCTTTTT AACGCCAAA 3 CTGCGGAC AA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTTCG TGGATGTCAA CoGTGCCAAA 
ATCACTGCAC CAACAGGCTT CACCCAAGGT AACCAAGTAC CAATGAAC AG TAA3ACCTTC 
AAGTACACAG CGGCA AAAGC TTTACCAGCG ACGTATACTA CAGGTGGCAA AGT3TATACG 
TTCCAAGGGT GGTATAAAGG GAAAACCAAG CCAAGTACGT TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGGCAATGAC GATATGACCG CCATGTACAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTGA TTGATACCAA TACCAATGTA 
ATCTGGACAA CAACGATCAC GAATACTAGC AAAGCACCCT TACAAAAC CT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TGGTCTGACG ATCCCGACCT TTATGGAA GT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CCCAGTAAAT AGTACACTTT GGACAGA GG GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGCTT TCACAAC r :C3 CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCAGAAGTT GTAGTATTT 3 GTGGTAT 1 * .AA AGATAGTACA 
GTGGATAACT TCGTGAGAAT TCGTCCAAAT GATCAAGAA G TAGTCACCCC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGTGG GCGTTGCA3 3 AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGATTAC TACGGTAACG GCACACGGAA TCCGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGCGC AACTGTC ACA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AGCGACCCGC TT ATT ATT A G GGGCGGCG CC TGTCTCTAGC 
TTTACCAATT ACAATCAACC AACCGAGTTG AAAAATACGG TCGGTACC AC GAGTGCCATT 
AGCTTAACAG CCAACAACAC AGCAACGAGT ATTATTGCCA ACAAGCAC TT CACAGGTAGT 
AATGTTTATC AGTTGGACTT CACCTTCAAT AATGTCAAAC TTGAAGT CC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAAGG3CGCA GTTACATGGA ACCTAGT": AC AGGTCCTTAA 

EF108-2 (SEQ ID NO:418) 

MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTIES GPTAESSAKF 

ETQASSVKEE TTKASTENSQ VTTCTSQEEA TKEAEKEEPQ AEVEQAF'.' CI I FKPKKINMK 

ATYSFSAETY QFGFVNESGQ LINPPIIPIT YSYAKGSWr'.T DGYNRKIC CM \\GSASTVGN 

LKNVIMPATS WMPPGPSYE GTQEVYTNFS IRIPKYYASA SLYNREGC I D CTYPLPAIAL 

AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG VKFTNFLCRI NDFDVKGNNI 

GYKTVSSPIY YHLTNRRVTE NFVCTSGAKI TPPSNFT'IGK QTVINSD1 VT FQQSGFLPET 
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TABLE 1. Nucleotide and Ami:u" A. id Seqeuences r: 7. recalls . :nzs 



YKYG7KSYRF 


K G V'rY X G ?' TK 7 




YK'CTYPP * ! ' I 


1 TP'VY* EI" £ 3 


7EEPAS7NQF 


GFVDEATInT.L 


capEvVQXky 


nepppiccppe 


7\t:£s:;ly3T 


33ATEK3 37 


PYI I YFEQYRV 


ntfygasett 


FTLFXRYKSI 


r::-;':--;- : gf.te 


paffppf : v : 


ie^yey: ccp 


YTPYIJKLKQL 


sgqtfgf:;al 


ALv'FEFYTKT 


lfgtesg;dd 


PYCrYYTYP 3 ? 


'etyyle:: ' 


TEICFVDTNGA 


kitpptgft; 


GKKTVITSDA 


Y7FF. ~ A37L? 


^TT^G ■*' " *' 


KFKGV.Y: 33 3 


:l::tltttka 


?sy£ytyde:-: 


eeeicyyyeee 


— ^ _ . „ 


I-PIE'-CIEF 3 3 - 


FT PAL TP . ' 33 


YYAQS7SAYL 


RTELYEVTSE 


nngng^ytvs 


::;::c-.v:-:pl:"q 


E7L?3< V IP ^ " : 


p:satk: ; :.;p 


1PP7KEAIDQQ 


LKY^/DSIQI," 


TA" ££ICEK£Y 




Y F E F ICY A. 7 A ^. 


■■DL£SEY...:CL 


pppfesdgtyf 


SNATTNRLFYT 


HLGYSGTPGV 




AKPAEES?::..V 


YrVTRKT'TE 


ia—evngaki 


TAPTGFTQGN 


QVPMICSICTFK 


Y 7 AA } 3~. I . ? A T 


YTTGGKYY7F 




ETENrKTTTPT 


FNATFC3NDD 


ktakykeeip 


TAS"TE7RPK 


evidtkyty; 


wtttityys:-; 


APLQNLTLKK 


GPNWSAGLT1 


FTFMEYTPEG 


ETTFPI?VII£ 


tlwtegvp: r 


ijavpigp :Y3 


V'AFTTRATGK 


PNTVLKAEW 


VFGGIKCSTV 


DKFVF.IRPIID 


QEV7TP77E3 


FISVPTP 7FG 


QVGVAGTKQQ 


HSLKQAADYY 


GNGTRNPYLR 


iek7pp:cwsl 


TAQLSy PKP A 


YL'SLPT;. pel 


LLGAAPVSSF 


TMYNQPTELK 


NTVGTTSAIS 


lta: ATf i 


I APJKQFTG3 "J 


'/yqldfp ■ :::: 


' 33EEVPANQG 


VK G 2 Q Y KAA V 


TWNLVTGP 











EF10 8-3 (SEQ ID NO: 419) 



CG7 GACGATTGAA AGTAGTCCGA CCGCCGAAAG TAG733CAAG 



GAAGAGACGC 


AAGCAAGTAG 


CGTGAAGGAA 


GAAACAACGA 


AAGCCA3 




GGAAAATAGT 


CAAGTAACAA 


CTGACACGAG 


TCAGGAAGAA 


GCAACGAA 




AAGCGGA 


^ J G\ 


AiGAAGAACCG 


CAAGCAGAAG 


7GGAACAAGC 


A G AAA C A C C A 


ATCATTCC 


PA 


AACCAAA 


■ S'Sk 


7v.YT'3AATATG 


AAGGCAACTT 


A7TCATTTTC 


TG C t-\ G >iAA J T 


TA7CAG7T 




*ATTTG': 




' I 1 G AA T C AGG T 


CAA7TAATAA 


ATCCAGATAT 


TATAC C AATT 


AC G TATA G 




ATGCCAA 


-,GG 


AT PATGGAAG 


ACAGATGGTT 


ATAATCGAAA 


GT3GACTAGT 


ATGGTTCA- 




3GAGTG 3 




.ViCCGTAGGA 


AACTTAAAGA 


A7GTAATAAT 


G3 iagcaact 


TC7GTAG7 




TGCCAC 




A : 3GTCATAT 


GAAGGAACTC 


AAGAGGTGTA 


capaaacttt 


TCAATTCG 


3 A 


TACCAAA 


P7A 


TTATGCATCA 


GCGAGTCTCT 


ACAATAGAGA 


AG3 : 3AAAATT 


GATTCTA I 


:t 


ATCCGTC 




T 3 7PATTGCA 


CTAGCAGGTA 


CTAGACCGCT 


AT3T77GACT 


CAAAGTA' ; 


i G 


T AATT AC 




ATTGGCGCTG 


ACCAGTAAAG 


GAGACAATGT 


TTA7 A C AC C A 


CGGGAAAC 


AT 


TTTTTG' : 




^G ATCCTGCA 


GGTGTAAAGT 


T7ACTAATTT 


T77G7ATCGT 


ATAAATGA 


3t 


TTGATGP 




AGGTAATAAC 


ATAGGTTATA 


AGACTGTGAG 


TA 3 ICC AATC 


TATTACCA 




tgacca:. 




CCGTGTCACC 


GAAAACTTCG 


TAGATACAAG 


TGG3GCCAAA 


ATCACGC : 




CAAGTA3 


— ..prji 


■3ACCCAAGGG 


AAACAAACGG 


TCATTAACAG 


TGA7CCTTAC 


ACGTTC 3AAC 


AAAGTGC 




TTTACCCGAG 


AC CTAC AAAG 


TTGGCACGAA 


ATC7TACCGA 


TTCAAAG'^ 


:t 


GGTACA - 




GAAAACCAAA 


ACCGAGCCTT 


TGGCCACCAC 


TAAAACACCT 


AGCTATAA 


AG 


TCACGTA 


T 3 A 


TGACAATGAT 


GATTTGACGG 


TGGTCTATGA 


GGAGTTTTCA 


GGGTACGA 




TGCCTGC 




GACCAATCAA 


TTTGGCTTTG 


TGGATGAAGC 


GACGAACAAA 


TTAATTG : 




CCGACCA 


Y3T 


'3CAGATGAAG 


TATAATCTTA 


CTTTAAATGA 


AAATAATAAA 


AAAACAG7 


AA 


TGAGCAC 


7AA 


CTTAACGGGG 


ACAGATACAG 


CGACACTGAA 


aaacttgtcc 


GTGCCTGT 


3 A 


ACTATT': 


7 3A 


ACAATATCGC 


GTCAATACGT 


TTTATGGCGC 


GAGTGA :A7T 


acgttta;: 


3T 


TGCCCA3 




3TACAAATCA 


ATCAATATTA 


CCAAATCAGA 




GACCCAGC 


7T 


TTCCTC7 




TAAAATCTAT 


AATATAGATC 


AAGTAGAAAT 


GTCA^ "ATG 


CCTGTGAC 


:a 


CTTATAA 




GTTGAAACAG 


CTGTCGGGCC 


AAACGTTTGG 


CTTTAATGCT 


TTAGCCGA 




aacctga 


ATT 


TTATACGAAA 


ACGTTATTTG 


GGACAGAGTC 


tgg:atcgat 


GACCCAG': 


3A 


ATT ATT/ 




AAT3AGTGGC 


CCTGTTTACT 


ATTATTTAGA 


AAA 3 3>^ ".AAA 


GTCACCGA 


3 A 


ACTTCG" 


\3A 


3 AC 3AACGGC 


GCTAAAATCA 


CACCGCCAAC 


AGGT7TCACC 


CAAGGTAA 




.aaacgc: 


3 AT 


7A3AAGCGAC 


GCCTACACTT 


TCAAACAAGC 


AGGCAC 3TTA 


CCAGACAC 




ACACAA: 




3 3 37AAGACC 


TACAAGTTCA 


AAGGTTGGTA 


CAAA3G CAAG 


TCCATA3P" 


PA 


ACACATT 




AAC TACCAAA 


GCGCCAAGTT 


ATCAAGTGAC 


CTAC GATGAC 


AATGATGA 


t-t 


TGAATGP 


3GT 


3TA7GAAGAA 


GAAACAGTTA 


CGACAGTGTA 


TCCATCAGTC 


GATATGAA 


^ T 


ttgtga; 




AAAA G G C GGG 


GCTTTCACAC 


CGGCGTTAAC 


TTT7AGTGGT 


AAGTACTP 




CGCAA^.3 




GAG7GCGTAC 


TTAAGAACCG 


ATTTATATGA 


CGT3A CCTCA 


AAAAATAA 




GTAATG 


33 A 


A7A7ACGGTA 
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TABLE 1. Nucleotide and Ammo Acid Seqeuences of E. fuecalis y'-xnea. 

AGTATTAATA ATGGTAGTAT GCCATTGTCC CAAGAATTAT TGAAAA/ ATA TAATAATGGA 
CAACCAATCA GTGCTACCAA CAGATTACAG TTTAATGTTG ATAAAT': AGC CATCGACCAA 
CAACTAAAAT ATGTTGACAG CATT'AAATTA GACACAGCTC AAAGTAGCAA TCTGAAATCC 
TATAGATATG TGTACACGAA CAATAG :TCA CTGGTTTTrG ACCCAA3.7GT AG3ACCAGCA 
GAGGTTGACC TTAGTTCAGA AT37CTTAAC TTGCTTAA7T TTGATT:A3A TG3CACCTAT 
TTTTCTAATG CAAATAATAG ACTT7TTTAC ACGCATTTAG GATATA 3T3G CACACCAGGA 
GTTAACTATC TTCTCGTAAT GTTT "TTTTT AACGCCAAAC CTGCGG r '-TAA GTCAAAACTT 
GTCTACAAAG TCACTCGCAA ACAAGTCACC GAAAACTT 7G TGGATGTCAA CGGTGCCAAA 
ATCACTGCAC CAACAGGCTT CAC 3CAAGGT AACCAAGTA 2 CAATGAACAG TAACACCTTC 
AAGTACACAG CGGCAAAAGC TTTAICA3CG ACGTATAC7A CAGGTGT-CAA AGTCTATACG 
TTCCAAGGGT GGTATAAAGG G AAA AC !.'AAG CCAAGTAC 3T TGAACAAAAC AACAACTCCA 
ACGTTCAATG CGACCTTTGA TGG:AAT3AC GATATGACCG CCATGT ATAA GGAAGAAATA 
CCAACAGCTA GTGTCACATT AACTCGACCA AAAGAAGTCA TTGATAC3AA TACCAATGTA 
ATCTGGACAA CAACGATCAC G AA T A C T AG C AAAGCACCCT TACAAAATCT CACCTTGAAA 
AAAGGGCCCA ATTGGTCAGC TG3TCTGACG ATCCCGAC ;T TTATGGAAGT GACACCAGAA 
GGAGAAACGA CAAAATCAAT CC CAGTAAAT AGTACACT ;T GGACAGA333 GGTTCCTTTA 
CCAAATGCCG TTCCTATCGG CAAAAAAGTT TCAGTTGC7T TCACAACTJ3 CGCAACAGGG 
AAACCAAACA CTGTTTTGAA AGCACAAGTT GTAGTATTT3 GTGGTATTAA AG A TAG T AC A 
GTGGATAACT TCGTGAGAAT TC 3TC CAAAT GATCAAGAAG TAGTC2 : ACC AACGACCGAA 
GGCTTCATCA GTGTGCCAAC CTTCGACTTC GGCCAAGT 2G GCGTTGCAGG AACTAAGCAA 
CAACACAGCT TGAAACAAGC CGCGGA 7TAC TACGGTAACG GCACACG3AA TC CGTATCTG 
CGGATTAAGA AAACGCAACC CAATTGGAGC TTAACAGC3C AACTGT C A TA ACCAAAATCA 
GCGACAGACA GCTTGCCTAC AG : 3A:CCGC TTATTATTAG GGGCGC^CCC T-3TCTCTAGC 
TTTACCAATT ACAATCAACC AA.::GAGTTG AAAAATAC 33 TCGGTAC 3AG GAGTGCCATT 

agcttaacag ccaacaacac ag:aa:gagt attattgc :a ACAAGCAA7T CACAGGTAGT 

AATGTTTATC AGTTGGACTT CACCTPTAAT AATGTCAAAC TTGAAG r :'3CC AGCCAATCAA 
GGTGTTAAAG GGCAACAATA CAA3 3CCGCA GTTACATGGA ACCTAG7TAC AG 

EF108-4 (SEQ ID NO:420) 

VTIES SPTAESSAKE 

ETQASSVKEE TTKASTENSQ VTCTTS/EEA TKEAEKEEFQ AEVEQA'CTFI IPKPKKINMK 
ATYSFSAETY QFGFVNESGQ LINFCI3PIT YSYAKGSWKT DGYNRt'V.'TSM VQGSASTVGN 
LKNVIMPATS WMPPGPSYE GT^FVYTNFS IRIPKYYASA SLYNREGK I D STYPLPAIAL 
AGTRPLSLTQ SSVISALALT SFG DNVYTPR ETFFGGDPAG VKFTNFCYF I KDFDVKGNNI 
GYKTVSSPIY YHLTNRRVTE NFVZ7SGAKI TPPSNFTQGK QTVINF: F'VT FQQSGFLPET 
YKVGTKSYRF KGWYKGKTKT E PLAT 7 FTPS YKVTYDDIOD LTWYF.'T:G YELPASTNQF 
GFVDEATNKL IAPDQVQMKY KLTLKC^NKK TVKSSNLTGT DTATLM :CCV PVNYFEQYRV 
NTFYGASDIT FTLPKRYKSI N 3 TV £ Z. GKTD PAFPLPK3 YN IDQVEM: • HMP VTTYNKLKQL 
SGQTFGFNAL ADQPEFYTKT LFGTE£'3IDD PVNYYTMSGP VYYYLE1 1? r'.V TENFVDTNGA 
KITPPTGFTQ GKKTVITSDA YTFKCAGTLP DTYTTGGFTY KFKGWYi "CKS 3 LNTLTTTKA 
PSYQVTYDDN DDLNWYEEE TVTTVYPSVD MNFVNEKGGA FTP ALT i TGK YYAQSTSAYL 
RTDLYDVTSK NNGNGQYTVS IKNGFMPLSQ ELLKKYNIiGQ PISATi:;-.LQF NVDKLAIDQQ 
LKYVDSIQLD TAQSSNLKSY F.YVYT Nt ISSL VFDPNVAF AE VDLSSECLHL LNFDSDGTYF 
SNANNRLFYT HLGYSGTPGV NYLLYMFLFN AKPADKSKLV YKVTRKCVTE NFVDVNGAKI 
TAPTGFTQGN QVPMNSNTFK YTAAFALPAT YTTGGKVYTF QGWYKGKTKP STLNKTTTPT 
FNATFDGNDD MTAMYKEEIP TASVTL7RPK EVIDTNTirv'I WTTTITHTFK APLQNLTLKK 
GPNWSAGLTI PTFMEVTPEG ETTKSIPVNS TLWTEGVPLP NAVPIGKFVS VAFTTRATGK 
PNTVLKAEW VFGGIKDSTV DIIFVFIRPND QEWTPTTEG FISVPT! LFG QVGVAGTKQQ 
HSLKQAADYY GNGTRNPYLR IKKTQPIJWSL TAQLSQPFSA TDSLPTATRL LLGAAPVSSF 
TNYNQPTELK NTVGTTSAIS LTAI^TATSI 1ANKQFTGSN VYQLDFT'FNN VFLEVPANQG 
VKGQQYKAAV TWNLVT 



EF109-1 ( SEQ ID NO:421) 
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TABLE ]. Nucleotide arid Ai:,::". . ;J Seueuence> o; i' . \;ccalis e:ie> 



2GATTTC - TG TTACCC 3TA.T 

Aj-u-.. i bnA'- _ ^, AAG ^ .-iT^^T 

Al-^T'^'^'jA*'* ^.--rt-vh . Ton 

NA7AG7AG73 ACGTACTGAA 
TAG 




7GT7AG ^GT TT7G77GTCG 
GCTC7AG •:• : AA2TAC7GAA 
7GAGTA' ?GG TGTCACTGAA 

aaacaa or:: gagttgctca 

CAACTG': 7AA T3A7ACAGAA 
AAGTAA. : AG GAGACACCTA 



EF10S-2 (SEQ ID Nj:422"! 



MKKSVI TSSMLAVLLS 3FLVT? IGAV ALERSKGTTE E77ASETSLT 
EMNPSIINSQ EETETTGTSS 7GGG : 7;- '.'ST GEVT™ "C77G XSSDVL: 

EF109-3 ( SEC ID NO: 42 3 » 

GGAA2 GCTCTAAGGG AACTACTGA^ 
GAAACGGTGG CTTGAGAAAC A7C7' 
GAAATGAACC CAAGCATCAT AAG-.7": 
ACCTCCGATT CCACCAGTGA AGTT" 

EF1C9-4 ( SEQ ID NO: 42 4) 

ERSKGTTE ET\ T ASETSLT ERQIGV/ 7/7E EMNPS I ING > EETETTS: 
EF110-1 {SEQ ID NO:425) 



. AACG GA.GCGACAAA TGAGTA' 
. G 2 AA GA.GGAAA7.- G AAACA>: . 

:taca tgag 



/TE 

\vmrthl 



:actgaa 

GTCCTCA 



:;:::3TTEVST s 



TAAATAAAAA 


TGGATAAGGA 


GTGG _ 


ATAAT 


0 7TATGAAA 




agttct; 




a :gaaaaatt 


agtgctggtt 


TTTTCTTTCT 


GA777 


7AGTA 






CCGGTT" 




7ITGTCTGCA 


aatgcagaag 


AGTATATCGT 


rp -."rp-. 
i. _ ^, j. _J 


GGGAA 


AGTCATTGA 




GACAAAA 




ATGGTTACTG 


GACCCTGAGG 


ACAGAAGACA 


AGAAG 


TGGCA 


GATACAACC. 


G 


AAGCGC" 




T G7GTCAATC 


GGAAGAATCA 


TTTCCCCTGC 




AA "OA 


GGCTATA7T 




CTTTAGG 


AAC 


A3GCTTTGTT 


GTTGGAACGA 


ATACAATTGT 


:A7CAA7AAT 


CATGTGG 71 




AAAGTT': 




G AATG C C AAA 


GTATTAAATC 


CGAATGCCAA 


A GATG 




TGGTT7TAC 




GAGGTC .". 




TGG2AGTGCG 


ACACCATTTG 


GCAAATTCAA 


AGTGA 


7GGAT 


GTAGCTTTG 






7GC 


G GATATTGCG 


GTAGTGACTG 


TCGGCAAACA 


AAAG 


ATCGT 


CCAGATGGC 




gagagt: 




AGAAATTTTA 


ACGCCATTTG 


TTTTGAAAAA 


GT77 2 




TCAGATACC 




ATGTCA : 


AAT 


AT2AGGCTAT 


CCAGGTGAGA 


AAAACCACAC 


a :aa/; 


G jT-T 


CATGAAAAC 




atttgt: 


7AC 


AT2TAACTTT 


ACAGACTTAG 


AAAATCCATT 


ACT AT 


777AT 


GATATG jAC 




:aaccg- 




T7AATCTGGT 


TCACCAATCT 


ATAATGATCA 


GGTTG 


AAGTA 


G77GGTG7T 




attccaa 


TGG 


G 2 GG ATT AAG 


CAAACAGGAA 


ATCATGGTCA 


AAGAC 


7AAAT 


GAAGTGAA7 




ATAAC7: 


TAT 


7GTTAATCGA 


GTG/iATGAAG 


AAGAAAATAA 


AGGTT 


7ATCC 


GC7GT 2 2 LV 




:agcgt- 






EF110-2 <SEQ ID NO:426) 














MKKFSIRKIS 


AGFLFLILVT 




SLSAN 


AEEYIVPAI- 




hsrqkpg 


GLD 




PEDRRQ'EVAD 


TTEAPFASIG 


r::sf 


ASKPG 


YISLGTGF'. 




GTNTIV7 




VAESFKNAKV 


LNPNAKDDAW 


FYPGR2GSAT 




RYIDV 


AFSPKADIA 


'.. 7 


vtvgkq: 


GRP 


DGPELGEILT 


PFVLKKFESS 


DTHVTISGYP 


GEKKH 


TQWSH 


EtJDLFTSNl" 




dlenpl: 




IDTTGGQSGS 


PIYNDQVEW 


GVHSNGGIKQ 




;}RLNE 


VtJYNFIVNr 




KEEENF'7 


GSA 


VPAA 



EF110-3 (SEQ ID KO:427) 

AG AGTATATCGT TCCTGCCGAA AG! CATTCAC GACAAAAAAG ATCT 7TACTG 
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TABLE 1. Nucleotide and Amino Acid Seqeucnces of E. jnecahs Genes. 

GACCCTGAGG ACAGAAGACA AGAAGT3GCA GATACAACCG AAGCGCCTTT TGCGTCAATC 

GGAAGAATCA TTTCCCCTGC CAGTAAA:CA GGCTATATTT CTTTAGGAAC AGGCTTTGTT 

GTTGGAACCA ATACAATTGT CACCAATAAT CATGTGGCTG AAAGTTTTAA GAATGCCAAA 

GTATTAAATC CGAATGCCAA AGATGAT3CT TGGTTTTAT: CAGGTCGAGA TGGCAGTGCG 

ACACCATTTG GCAAATTCAA AGTC-AATGAT GTAGCTTTTT CCCCGAATGC GGATATTGCG 

GTAGTGACTG TCGGCAAACA AAACGATCGT CCAGATGGC Z CAGAGTTGGG A 2 AAATTTTA 

ACGCCATTTG TTTTGAAAAA GTTTGAATCT TCAGATACCC ATGTCA^AAT ATCAGGCTAT 

CCAGGTGAGA AAAACCACAC ACAATAGTCT CATGAAAATG ATTTGTTTAC ATCTAACTTT 

ACAGACTTAG AAAATC C ATT ACTATTTTAT GATATCGATA CAACCGGCGG TCAATCTGGT 

TCACCAATCT ATAATGATCA GGTTGAAGTA GTTGGT 3TTA ATTCCAATGG CGGCATTAAG 

CAAACAGGAA ATCATGGTCA AAGACTAAAT GAAGTGAATT ATAACTTTAT TGTTAATCGA 
GTGAATGAAG AAGAAAATAA ACGTTTATCC GCTGTGCCA:; CAGCGT 

EF110-4 <SEQ ID NO:428) 

EYIVPAES HSRQKRSLLD 

PEDRRQEVAD TTEAPFASIG RIISPA. f ;KPG YISLGTGFVv GTNTIVTIJNH VAESFKNAKV 
LNPNAKDDAW FYPGRDGSAT PFGKFKVIDV AFSPNAD I A\ r VTVGKC'TORP DGPELGEILT 
PFVLKKFESS DTHVTISGYP GEKNHTCWSH ENDLFTSMFT DLENPLLFYD IDTTGGQSGS 
PIYNDQVEW GVHSNGGIKQ TGNHGQHLNE VNYNFIVMPv NEEENKRLSA VPAA 

EF111-1 (SEQ ID NO:429) 

TGATCAATAC ACTTCGATAC GGTCGCTTTT TTTCTAGAG A AAGTTGAATC TTTCAATAAT 
AAAAAGGGAT ACACTCCATT TGGCATAGTC CTTGCTGATA ATAAATCAGT GTATAAAGCG 
CTATCATTTT ATAGGAGGGG TTTTATGAAG GGTTTATCAA AAAAGAAACG GGTGTCTACT 
TGGTTAGCGT TAGGAATCAC CGT AG TC AG C TGTTTTGCCT TAAGCAGGGA AGTGCAAGCA 
AGTGTTGAAA GAACAAAAGT TGATGAATTT GCAAATGTTT TAGATGTGAG TGCATCACCA 
ACCGAACGGA CGAATGGCGT ATACGATACC AATTATTTTA ATAATTTTTC TGATTTAGGT 
GCATGGCATG GCTACTATTT AC CTG AAAAA AGCAATAAAG AGCTACTGGG TGGTTTTGCG 
GGGCCATTGA TTATTGCGGA AGAATATCCA GTAAACTTC'G CGGCAAGTTT AAACAAATTA 
ACGGTCAAAA ATAAAAAAAC GGG AG AAAC C TATGATTTAA GCCAAAG 2AA CCGCATGGAC 
CTGTCTTATT ATCCTGGGCG CCTAGAGCAA ACCTATGAAT TAGACGATTT AACGATTCAT 
TTAGCTTTAA TTTTTG TC AG CAATCGAACG GCGCTTATCC AAACGAC ACT TGAAAACACT 
GGTGAAGAGC CCTTGTCACT TGGAGCAAGC TGGACAG 3JTG CGGTCTTTGA CAAAATTCAA 
GAGGGAACGG AAAC CTTAG A TATTGGCACT CGTTTAACTG CTAAAGACAA TG AC ATTCAA 
GTGAATTTTG GTGAAGTCAG AGAAACGTGG AATTATTTTG CTACGAAAGA CACAAAATAT 
ACGATTCATC ATGCGGATAA AGTTTCAACA AAAATTG AT' A ATCGGAATTA TACAGCAACC 
GCTGAACCAA TTGAATTGAA GCCTAAACAA ACGTACAACA CCTATACGAC AGAAAGCTAT 
ACTTTTACAA AAGAAGAAGA GGCAAAGGAA CAACAACA/,^ CACCCGAATA TACCAAAAAT 
GCGGCGCGCT ATTTCAAAGA GAACAA3CAA AGATGGCAA o GATATCTAGA TAAAACGTTT 
GATCAAAAGA AAACAGCAGA ATTTCCTGAA TATCAAAAV3 CGCTAGTCAA ATCGATTGAA 
ACGATTAATA CCAATTGGCG AAGTGGGGCA GGTGCCTT^A AGCATGACGG GATTGTTCCG 
TCCATGTCTT ATAAATGGTT TATTGGTATG TGGGCTTGGG ATTCGTGGAA AGCGGATGTA 
GCAACGGCTG ATTTTAATCC TGAGTTAGCT AAAAATAA'A TGCGGGACTT GTTTGATTAT 
CAAATTCAAA AAGATGATAC CGTACGTCCA CAAGATGCAG GAGCGAT TAT TGATGCTGTC 
TTTTACAATC AAGACAGTGC GCGTGGTGGT GAAGGTGGCA ACTGGAATGA ACGAAATTCT 
AAAC C AC CAT TGGCTGCATG GGCAGTTTGG CATATTTATC AAGAAACCAA AGATAAGGAA 
TTTTTAAAAG AAATGTATCC CAAACTTGTG GCTTATCATA ATTGGTGGTA TACCAACAGA 
GACCACAATA AAAATGGGAT AGCAGAATAT GGAAGCATGG TCAGTGATGC TCACTGGCAA 
AAAGACGACA AGGATCAAAT C ATT AAA GAT AAAAATGGCC A:CTAAA3TG GATGATGATG 
CTGTTATTGA AGCAGCCGCG TGGGAAAGTG GCATGGATAA C jCTACACGG TTTGACAAAG 
AAGGTGTGGG CAAAGGCGAC GTTGGAGTTA AAG TTTTTG A AAA C AAA. AA T AAAGGAAAAG 
TAG 
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TABLE 1. Nucleotide and Amino A.;,-; Seqeuenc j> .-. t 'enes 



MK G E S K KK R V S TW 



eaegityysg 


FALSREVQAS 


VER7K* 


"3 ERA 


I r." E EEJE.--. ; ' • 


ektegyy 


GTE 


EEEIJFYDLGA 


WHGYYEPEKS 




pli :a: 


3EYP7 


eeaaeeeee e 


J'EITEXTE 


ETY 


EEE eg: JRMDL 


SYYPGREEQT 


YELECLTIKE 


ALIFV. 


/eeta 


l; etteeee ; 


E E P E 3 E 3 


aev: 


TGAYFDKIQE 


GTETEGGGTR 


LTAKDNCTjV 


NFGEE" 


3 E7V.EJ 


Y E AT: r I- 7 E ' E 


:h:-;ajK3 


EYE. 


;ej]rit/tata 


EPIEEKPEQT 


YNTYTTESYT 


FTEEE- 


v v r 

----- 


£ ~A?EYEYEEA. 


AERY FX EE 


.\ ^r. 


' JE GYLEKTFD 


QKKTAEFPEY 


<; neaeyke:et 




vSAAG 


aeeheeee . : 


jxv;^: 


JIG. 1 


;j.-;eev:kadva 


TADFNPEEAX 


eetkralfeyq 


ICEEE' 


— p ^ 

- ■ j - * it 


gaga: :~a" t 




JG' 7 


G-j=:^;nERNSK 


PPLAAWAVWH 


I YQETKDKEF 


LXEMY 


-KLVA 


yhee-.eyte;- e 


HEXEGIA 


rr"/.;*; 


e:e j sdahwqk 


EEKDGGIKDX 


EGKIE^'.EMMML 


L1K 


EEYA 


\\ I TEGGE7 AY 


■/v.akat: 


EEE. 


EEKTKIKEK 


EElli-3 (SEC 13 : 43 1 


) 












TP 21 TP L £ H^TT 


L- v_ j^j-^j-: A V? 1 i A. 


TAG AT; 


7TGAG 


TGGATGAG3A 








ACCGAACGGA 


GGAATGGG 3T 


ATA 3G. 


ETAGC 


AATTATY7TA 


Y'.AATY'T 


'PTE 


TGATTTAGGT 


GCATGGCATG 


GCTACTATTT 


ACCTG. 


YAAAA 


AGO A a - T A E -. • 


AG 3TAGE 


GGG 


'"GGTTTTGCG 


GGG 3GATTGA 


TTATTGCG3A 


AGAA7. 


ete :a 


GTAAAG J'JE' : 


3GAAG 


TTE" 1 


AY^AGAAATTA 


ACGGTCAAAA 


ATAAAAAAAC 


GGG AG. 


aaa eg 


TATGATTTAA 


GG 3 AY A 3 


— ». * 


7 3GGATGGAC 


CTGTCTTATT 


ATCCTGGGCG 


cjtag. 


EG GAA 


a:ctatga\ay 


3 A AA3-.-Y 


'3TT 


,7E 3GATTCAT 


TTAGCTTTAA 


TTTTTGTCAG 


caat: 


3 AAGG 


gjgcttatee 


A.AAGGAC 


A3T 


EEAAAACACT 


GGTGAAGAGC 


CCTTGTCACT 


TGGAG 




tjeagagge"; 


GG.3T3T 1 : 


T3A 


3 AAAATTCAA 


GAGGGAACGG 


AAACCTTAGA 


TATT .3 




C ETTTAA"' Y 


G i ,Aj-ljaG/- 


JAA 


EJA3ATTCAA 


GTGAATTTTG 


GTGAAGTCAG 


AGAAA 


3GTGG 










ACGATTCATC 


ATGCGGATAA 


AGTTT 




j-l.-v-^-i ee ■ - . _ j 


AT 3GG AA 


'ETA 


EA 3AC;CAACC 


GCTGAACCAA 


TTGAATTGAA 


GCCTA. 


aa : aa 


A3 3TACAY->3A 


GG TAT AG 


GAG 


AG.AAAGCTAT 


ACTTTTACAA 


AAG AAG AAG A 


ggcaa 


. ^ ; aj\ 


gaajaaeeee 


GA. 3G 3G3- 


ATA 


EA3GAAAAAT 


GCGGCGCGCT 


ATTTCAAAGA 


GAAGA 


AG :aa 


AGATGG 3 AAG 


GATATGT 


A 2 A 


TAAAACGTTT 




AAACAGCAGA 


attt: 


:tgaa 


TATGAAAA- - G 


GG3TAGT 


GAA 


AT 3GATTGAA 


ACGATTAATA 


CCAATTGGCG 


AAGTG 


eggca 


GGTG3GTTTA 


AGGATGA 


\_ jj'j 


GATTGTTCCG 


TCCATGTCTT 


ATAAATGGTT 


TATTG 


3TATG 


TGGG 3TTGGG 


A TT ""G^G 


GAA 


A 3 3 3GATGTA 


GCAACGGCTG 


ATTTTAATCC 


TGAGT 


E A3GT 


AAYAATAATJ, 








CAAATTCAAA 


AAGATGATAC 


CGTAG 


3T3GA 


C. AAG AT 7 JAG 


■JAA 3GGAE 




EGATGCTGTC 


TTTTACAATC 


AAGACAGTGC 


GCGTG 


3TGGT 


G AAG G T 3 G J .~ 


ACTGGA_ ? - 




A 3GAAATTCT 


AAACCACCAT 


TGGCTGCATG 


GGCAGTTTGG 


CATATTTA/EE 


/vA -G AAA 1 




A GATAAGGAA 


TTTTTAAAAG 


AAATGTATCC 


caaa: 


TTGTG 


GGTTATGATA 


ATTGGT3 


GTA 


TA3GAACAGA 


GACCACAATA 


AAAATGGGAT 


AGCAG 


AATAT 


GGAAG 3ATGG 


T3AGTGA 




T3A3TGGCAA 


AAAGACGACA 


AGGATCAAAT 


CATTA 


AAGAT 


AAAAAEEGEEE 


A3 3TAAG 


GTG 


GAT 3ATGATG 


CTGTTATTGA 


AGCAGCCGCG 


TGGGA. 


AAGTG 


G A T G 1 j A. T / 


C<22TAZ:- 




TTTGACAAAG 


AAGGTGTGGG 


CAAAGGCGAC 


gttgg 




AAGTT 









EF111-4 (SEC 3D 130:432) 

DEFA NVLDVSASPT ERTNGVYDTH YENNFSEEGA 

whgyylpeks ij>;ellggfag pliiaeeypv keaaslmk: T Y1'[^:KTGETY eeg^snrmdl 

SYYPGRLEQT YELDDLTIHL ALIFYSNRTA LIQTTLEIJEE EE PES EG ASW T3AVFDKIQE 

GTETLDIGTR LTAKDIJDIQV NFGEVRETi^N YFATK.DTEYT 3HHADKJJ3TK lENRIIY'TATA 

EPIELKPKQT YJITYTTEEYT FTKEEEAKEQ CQAPEYTEEA AFYYFKEEK2R V.'EGYLDKTFD 

QKKTAEFPEY QNALVKSIET I NTNVCR S AAG AEKKDGIET E MSYKV7FI 3Er.\" AWDSVJKADVA 

TADFNPELAK IJ1IMRALFEYQ IQKDCTVRPQ E AGA3 ZEA'JE YT3QDSAEGGE G3IEYNERNSK 

PPLAAWAVWH IYQETKDKEF LKEMYFKEVA YH^V.E'TEEE HITKKGIAEYG SITE SDAHWQK 

DDKDQI IKDK IJGliLKVM^ML LLKQPEGKVA IJ I TEH GET EE EE AKAT 3 EE L E 
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EF117-1 (SEQ ID NO:433) 

TAATTCGATG GAGAAGGTGG TTTAGTGAAA 

GCTTTAACAG GTTGTAAATC C GG T G AAAAA 

AAGGAAACGN CACAGTCTTA NTC AG AAA C A 

GAATATATTT CTTTGAAAGG GGAGATTGTT 

AAAGGTGATC GTTTTATTTT GAAAAGTGGT 

AAGAAAAAAT TGAAGATTGG TGACGAAGTG 

AAAGGGACAT TAATTGAAAG TGAGGAGAAT 



AGATTTTCAT TTTTTTTACT aattttactt 
gaatttga7g aagaatctct tc aaaatcta 
gaattacaaa atggtgacgt tcgtttaaat 
gagagtgac a gtcgttccag tttaataaaa 
tctagtaaat atcaagtttn taatgagcaa 
acagtctacc gagaatatta cggctttttg 

CATGATTCAG CCACGAATTA G 



EF117-2 ( SEQ ID NO:434) 



VKR FSFFLLILLA LTGCKSGEKE FDEESLQNAK ETX^CCSETE LQIOGDVRLNE 
YISLKGEIVE SDSRSSLIKK GDRFILK3GS SrlYQVXMECK XKLK I GDEVT VYGEYYGFLK 
GTLIESEENH DSATN 



EF117-3 (SEQ ID NO:435) 



TG AAGAATCTCT TC AAAATC T A 
AAGGAAACGN CACAGTCTTA NTCAGAAJVCA 
GAATATATTT CTTTGAAAGG GGAGATTGTT 
AAAGGTGATC GTTTTATTTT GAAAAGTGGT 
AAGAAAAAAT TGAAGATTGG TGACGAAGTG 
AAAGGGACAT TAATTGAAAG TGAGGAGAAT 



g;jvttacaaa atggtgacgt tc;;tttaaat 
gagagtgaca gt:gttccag tttaataaaa 
tctagtaaat atcaagtttn taatgagcaa 
acagtttacg gagaatatta cggctttttg 
catgattcag ccacgaa 



EF117-4 (SEQ ID NO:436) 



EESLQNLK ETXQSXSETE LQNGDVRLNE YISLKGEIVE SDSRSSLIXX GDRFILKSGS 
SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLIESEENH DSATN 



EF118-1 (SEQ ID NO:437) 



TGAGGGGGAA AAAGTGTGTT 

GCTTTGTTAC TGACAGGTTG 

AATCANAACG CACAAGAATC 

TTTTCACAAG AAGATGGTGC 
AAAGACGCAT CGCTTTCTGG 

CTTAGAGATG AAATTAAAAG 
AGATAA 



AAAAAGAAAA GTGGGGATTG 
TGGCAAAAGT GCGAAAGATG 
TGGGGTTTGN GATTTCTCTA 
ACAAACGAAT C "TATGATTG 
GGAAGATTCA AGTAGATGCC 
CGATGGGAAT GGAT^CACCG 



TCGCAGGCGT TTTCTGTTCA 
AGTTCATTCA AGCAATCGGC 
TGTCAATTAG TGACATGAAA 
GGATGCTC AT CACGCAAATC 
AAAAAA G A. AA AAGCATTCAA 
ATTTCATTGG TTGGATCGTT 



EF118-2 ( SEQ ID NO:438) 



VLKRKV GIVAGVFCSA LLLTGCGKSA KDEFIQGIGK XKAQESGVXD FSMSISDMKF 
SQEDGAQTNP MIGMLITQIK DASLSGEDSS P, :QKRKSI^A RDEIKSDGN3 CTDFIGWIVR 



EF118-3 (SEQ ID NO:439) 



GAAAGATG AGTTCATTCA AGGAATCGGC 
AATCANAACG CACAAGAATC TGGGGTTTGN 
TTTTCACAAG AAGATGGTGC ACAAACGAAT 
AAAGACGCAT CGCTTTCTGG GGAAGATTCA 
CTTAGAGATG AAATTAAAAG CGATGGGAAT 
AGAT 



GATTTCTCTA TGTCAATTAG TGACATGAAA 
c:tatgattg GGATGCTCAT CACGCAAATC 
AGTAGATGCC AAAAAAGAAA AAGCATTCAA 
GGATGTACCC ATTTCATTGG TTGGATCGTT 



WO 98/50554 



PCT/IS98/08959 



TABLE ). Nucleotide and Ammo A^id Seqcuc; 



:> v icnes. 



EF116-4 (£E^ 



KCEFIQGIGN 


XNAQESGVXD 


fsksi: 


gmkf 


s ; r." ..-A . - 


:-::c:-;Er; 


^ I I" ' 


::aslsgedss 


KCQKRKSIQL 


rceiksdgng 


ctdf:: 


wivr. 










EF113-1 (SI 


IQ ID NO: 44: 














T AAA G AA T A. C 


C G A G T/iAAA T 












ATG ^AAAAGA 


A G TG ; ^ AA C GG 


AAAGGAGC * C 




"CGTG 








TTTATTGTTA 


GGCGGAGTGT 


TGC77G77AG 


TGGCTC 


•t:aa 








GTCTAGTAAA 


GTATCGACAG 


AAGAAACGAC 


AGTGAI 


'TGAA 








gaaggaatcg 


TT7ACGAGTG 


AAGCAAC TAA 


AAAAC; 


.GACA 






AG A 


- A 3ATCAT 


GTAAAACTTC 


TAGAAGCTTA 


TGGAA/ 


agcg 






TAT 


TAA I 3ATCGC 


AATGAAAAGC 


TAAAGCCCCT 


catga; 


T ^AA 








AATT jATGTT 


AAAACTGGAG 


TAGCGTTAGT 


TTCCG" 


\A 3GA 








IGATCAA 


CATGAATATG 


CTTTACTTTT 


GGATTC 


-TGAA 








AGTGTTACTT 


TTGGCTAAGG 


TGAAGAACAA 


TAAAA*: 




G AAA TG A 






TAAG lAAGAG 


TATTAG 

















EF119-2 (SEQ ID NO:442) 

VN KLPLEILLLG GVLLVSGCQS HXEENKSSKY STii 
TTKLEEPDHV KLLEAYGNAY ANFTSINDRN EXEXPL 
VTTI YKKDQK EYALLLDCEQ NGTQTF.VLLL AKVXNi- 

EF119-3 (SEQ ID NO:443) 



:et vareqskesf 
e i kkng z e'ak tg\ 

XTYIOS'A-'.GEY 



3EATKKQTE 
^VSVGK 



AGAAAACAA GTCTAGTAAA 

GTATCGACAG AAGAAACGAC AGTGA7TGAA ACAG7AGCAA GGGAACAATC GAAGGAATCG 

TTTACGAGTG AAGCAACTAA AAAAC AG AC A GAAACAAC^A AATT AG AA 2 A ACCAGATCAT 

GTAAAACTTC TAGAAGCTTA TGGAAATGCG TATGCGAACT T T A C AA G TAT TAATGATCGC 

AATGAAAAGC TAAAGCCCCT CATGACTGAA AAATGTATC A A. AAAA AA TG G AATTGATGTT 

AAAACTGGAG TAGCGTTAGT TTCCGTAGGA AAGGTTACAA IGATTTATAA AAATGATCAA 

CATGAATATG CTTTACTTTT GGATTOTGAA CAAAATGGAA CGCAGACACG AGTGTTACTT 

TTGGCTAAGG TGAAGAACAA TAAAATTTCT GAAATGAC - .'T A T AATT C AG T TAAG C AAG AG 
TAT 



EF119-4 ( SEQ ID NO:444) 

ENKSSKV STEETTVIET VAREQSKESF TSEATKKQA 
ANFTSINDRN 

EKLKPLMTEK CIKKNGIDVK TGVALVSVGK VTT I YK: 
AKVKNNKISE MTYNSVKQEY 



:KLEEPC!-".' KLLEAYGNAY 

-! EYALILGCEQ MGTQTRVLLL 



EF120-1 ( SEQ ID NO:445) 



TGAATAGGCG TGAAAAAGGG AATGT7AGCG TTTTI 
ACTGCTTGTC GGGAACCAAA A G N AAA G AAA GTAAC 
GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAAI 
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA AATG-- 
AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGC 
TTAGTTGGAA TATGGAAAAA TG G T AA A GG A GAG AC 



*G TGCTAGCGGT TTTATCATTA 
^7 CAACGGAGGC ATCCTCTAAA 

"'a agacaaacga acaagcgagc 

cgacagct3a tggaaacaat 

;a atgctggcga ttttactact 

-a tcattcatcc tgatggtagt 



WO 98/50554 



PCT/US98/08959 



TABLE 1. Nucleotide and Amino Acid Seqcuence" of /:. fin -calis Genes. 

ACAAATACCG GAGGAATGAT TACGAAGGAT TCA:C7A::7G ATGAGTCGCG ACCAATTACA 
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGCCC7A7 TATTATATAA AATTGGTGTT 

EF120-2 ( SEQ ID NO:446) 

VKKGMLAF FWLAVLSLT ACREPKXKKV TASTr AYf K EETHEKTSET IDKTNEQASS 
SVESNESVKN EEPTADGNNS QLTVADLDTT AIKACDFTTY VG IWKI-1GKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA AL7LYK I ' 

EF120-3 (SEQ ID NO:447) 

AAGAAA GTAACCGCTT CAACGGAGGC ATCCTCTAAA 

GTTGAAGAGA CGAATGAAAA AACGAGTGAA ACAATTGATA AGACAAAC3A ACAAGCGAGC 

AGCAGTGTCG AGTGTAACGA ATCAGTGAAA AA'I 3AAGACC CGACAG:T3A TGGAAACAAT 

AGTCAGCTAA CTGTAGCTGA TTTAGATACT ACAGCGATTA ATGCTG3CGA TT TT AC TACT 

TTAGTTGGAA TATGGAAAAA TGGTAAAGGA GAGAGTTTGA 7CATTCATCC T3ATGGTAGT 

ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTA Y'.i'G ATGAGTCGCG ACCAATTACA 

AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTGC3CTAT TATTATATAA AATTGGTGTT 

EF120-4 ( SEQ ID NO:448) 

KKV TASTEASSKV EETNEKTSET IDKTNEQASS 

SVESNESVKN EEPTADGNNS QLTVADLDTT AiriAGDFTTL VG IVJKNGKGE SLIIHPDGST 
NTGGMITKDS PTDESRPITS LSIRWGPTGA ALLLYKIGV 



EF121-1 ( SEQ ID NO:449) 

TGAAACACAA GGAGGAAATT TGTGAAAAAG 
C AT TTTTT AA TGGCTGTTGC GTTGATAGCG 
GAAACAACGA GTCAACAAAG TTCAGAAGCA 
CAAGAACCAG TCATTACACA GGAAACAACA 
ACGAGTGACA GTGTCAAGCA GTCACAAGAA 
GAAACGTCAA TCGCTGAAAA AGAAGAAACG 
ACGTCAGATG TTCATGGTCA ATTATGGAAT 
GTTGGTTTGT CCCAAGTAAG TACAGTCGTT 
ACCGTTTTAA TTGATAATGG CGACAATATT 
AATAAAGCGC CTTTAGTGAA TGAAAAGACC 
AAGTATGATG CAATGGTTTT GGGAAATCAT 
AAAATTCAAC AAGAAGCCAC TTTTCCAATC 
GGTCTTCGTT TTGTTGAAGG GACTACCACG 
CCAGATTTAA AAGTTGGGAT TATCGGCTTA 
CCTCGTGTTA CTTCGCTTAA TTTTTTACCT 
GAGTTGAAAG CTAACGATCA GGCTGACATT 
AATAGTGATC CGGCTGCCAG TGCCGACCAA 
TATATTCTGG GTCATGACCA CCTTTCTTTT 
ACTGTACCGG TAGGGGGACC GAAAGATACG 
GTTGCTAAAA ATGCCGATAA GTGGGAAGTG 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG 
CGAGCGTTTA TTCAGGAGGA GATCGGCACA 
ATTAAAGGAA TTCCCGAAGC ACAATTACAA 
GTTCAAAAAG AAGTAACGGG CGCACAATTA 
AAATTACCTG CGGGGAAGAT TTCCTATGCC 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA 



TIGAGCT77A AAAAAGTGAA GTGGGGCATG 
ccaagtgt^a CTAGTACGGC ATATGCAGTA 
GTAACAAGTA CCACCGATTC AAGTAGAAAA 

ga:a:c; :.;.y aagaagcacc ;aatcaggct 
ACCA7AGCAC O'iACAGAGAC gacgaattta 

agcaygycgy aaaaaataac /vattttaggt 
tggt:ttatg aagatgataa agaactacca 
aaccaactcc gggcacaaaa cccagcaggc 

CAAGGCA7TA TTTTAACAGA TGACTTGTAT 
catc::aatga TCACCGCCAT GAATGTGATG 
GAGT7TAA7T TTGGTTTACC GTTAATCAAA 
TTGTCTGCGA ATACCTACAA TAAGGAAGAT 
AAGGAACTTG ATTTTAATCA AGATGGGCAG 
ACAATT'rCGC ACATTCCTTT GTGGGATGGC 
TT 3 AAA iAAG AAGCAGAAAA AGCAGTTACT 

attgttg c c t c 3attc atgc ggg ac aacaa 
gtaattgaaa atgtcgc^gg gattgatgcg 
accaag:aag gagcagcscc gaatggaaaa 

GGGACA---AAG TTGTCA.AAAT T3ATCTTTCA 
CAAGAAGG7A CAGCAACGAT T3TACCAACA 

g:ag:gacaa aagaatacca tgaaaaaacg 
gtaaca " y7g attttttacc aaaacaagaa 
ccaacagcga tgatttcttt aattaataac 
agtgcggc ag cgctgtttaa atacgacagt 
acgatttttg atatctacaa atacccgaat 
aacttactg a ag tatty' ag a aaaacaaggg 



WO 98/50554 



I'CT US98/08959 



21. ■ 



TABLE 1. Nucleotide and An.ino Acid Scqei. 



G7A7ATAA77 
GoTGAAC 3AA 
AGGATTG7TA 
GAA'_ CTATTA 

AA77TTGATG 
AT77CGACT7 
7G7AA.TGGGG 

33::gactag 

CAAGAAAACC 

AA77TCTCCA 

GCGGTGGGAA 

CTGAATTTTC 

GATCCGTATA 

GCAGAAACAG 

AT77CAGAAG 

TTGGTGGTTA 

ACGCTAGCAG 

CATTCGCATA 

GGTAATTATT 

ACAACAAAAT 

GCCATCGTTG 

TATATTCCAT 

GGGAATTTGA 

G ATTTTG C AG 

GTTGGGGATG 

GGCGCCCAAG 

AATGGCACAA 

CTACATGTTT 

GCTAGTGACG 

GAAATTAAAA 

ATGGCTACCA 

CGAGAAGAAG 

TTGCGGTTGT 

GGCGAACGAA 

ACGCCAAAAC 

AATAAACAAA 

TATGGCGGCT 
CTGTTCGTTG 

TAG 



ATGAC/-iTGA7 

ct:ttgat: : 

AAAAATGG TG 
CTCG7GAT33 
GCTTTGATTT 
AAG7AGGGAA 
CAA7GTTGAT 
AAGGCG'^. 3 jA 
ATCACGAGTT 
CG A77TTAT 7 

caatcgta 5a 
caa7gaaaa7 
tagaagca7t 
ctgggca77t 
aaaccct7ag 
cagccgtcga 
taaataatgt 
tgatttcag: 
atgaagcacg 
tcacattgga 
ttggtgatgc 
taactaatgg 
tc attgctgt 
ttaaagaaat 

7TT7ACTA7A 
CAAGCTCCAT 
AAGGCAAT3A 
ATCGGCAAAC 
ATGATTTCTT 
GGATTTCACT 
ATCGTGCAGG 
TTGTTTCTAT 
CAGATCCAAA 
ACCAAGCGGG 
TTTTACCTAA 
GACTTTCTTC 



AGATA-nTG 7A 

aggzgaatta 
ggtggat3:a 

TA7GGC 7 AAA 
7GA77TTG G7 
TAG7AATAGG 
AAAA7C7G A7 
AG A 27CGAAA 
GA- ^/v^G GAA 
AGG7G7AGA7 
TCAAACATAT 
AAGTGGCAAA 
TG 3GAT^GTC 
AGCAGAGCTG 
TACGAATTTT 
TGGA7AAGGA 
GATTATGTCA 
TGGC3GCATT 
GTTA77777T 
G77TGAAATG 
TGATGCTG77 
TCGTATCC AC 
AACAGGACAA 
ACAAAAG77T 
AGCTGCITGGT 
AGATTCTGTG 
AACGACGA7T 
TTCGGAAGAA 
ACCAGACCCG 
AGCAAGACAG 
AACGGGTACA 
TTCTGGCTGG 



TA f . 



tgagt 
CAA77 

acc;a'i 

GAA77 
GG77A 

ttaat 

ga':t'i 

G777A 
AAAC7 
AGCA* 
GA^'^ 
TA . AT 



.- 7 A 7 A r ,- J 



\GTG :g 
7AA7iA./' ~ 

.aw;,GT 



A7G-AGAA; :-AA 

A7GATA7~-7T 

AATATAAA 7 A 
AG7"7AA7 7'GG 

cagaaa:;. cc 
g::gtaa:aga 

A_- 7 G A 2 G 7 AA 



'-.7A7TCGT 
: -AG CAGTG 
AAGAATAT 
AAG7AGGG 
\7ATTAAG 
GGGACA 
7A777TTCAA 
A2AAGATGTC 
TGA7GTTCAT 
C777AAAGAC 

a: aaatctcc 

7 2A7 2CCATG 
TAAAGACCAA 

a::gg3ttttt 

T J7GACGACC 
TAAA GACCCG 
TA7:72AAGCT 
G 7 A7 2GTGAT 
GA77 GATGGA 

7G :t:aaaca 

A \ ". 7AAGAAA 

g: :agttaaa 

A/7 TGTCGAT 

;a g*.:gaactta 
vG ::aa7GTGCT 

JG G : 7AATTAAA 
,T T7AAGTAACC 

:a a/.aagatgag 
•.a t::tggtttc 

;t a7 7aggaata 

lA A7AATACCGG 

:t aggtggtgaa 
,g tg:aaccagc 
;a a;:attccca 
;g tggaggagag 
:■: a:;tagcaacc 
;a g.aagaaaaag 
:7 a7atggttta 
aagctagt 



EF121-2 (SEQ ID NO:450) 

VKKL SFKKVKWGMH FLMAVALIAP SV7STAYA' *E 77. A .7:7 A 7 77 77DSS7KQ 

EPVITQETTD IKQEAPNQAT SCSVKQSQET TA77E7771 7 T7 7 AFX7777G 7?7?'ITILGT 

SDVHGQLWNW SYEDDKELPV GLSQVSTWN QV/A7777, V7:0NG7777 G777TDDLYN 

KAPLVNEKTH PMITAMNVMK YCAMVLGNHE F7:777?77 7 i;7EA777:L EA7TYNKEDG 

LRFVEGTTTK ELDFNQDGQP FLKVGI IGLT I?H I ?7V;7 7 : P777SLMF7PL KEEAEKAVTE 

LKANDQADI I VASIHAGQQN SDPAASADQV IE:r7AG;77. ILGHDH177T K7 7AAPNGKT 

VPVGGPKDTG TEWKIDLSV AKNADKWEVQ EG7A77\7^'7' 17 *?ADEA':7A. A77.EYHEKTR 

AFIQEEIGTA TADFLPP'QEI KGIPEAQLQP TAI-3I3L::::7- V.77VTGA77S A.AA.LFKYDSK 

LPAGKISYAT IFDIYKYPNT LVSVPINGEN L7777LF7''; AA V7! j 77Q7 77L TISFNPNIRV 

YNYDMISGVD YKIDISKPVG ERIVDAKIDG QF77PAF.F77 IA7:K:7i77GG LASQGIQVGE 



WO 98/50554 



PCT/US98/08959 



2D 

TABLE 1. Nucleotide and Amino Acid Seqeucncc;; r: :\ fee cults Genes. 



PIKNSDPETL 


RGMIVDYIKK 


KGTLDPECEI 


erni I "-"!-; 


FT 


TKWP.A 


yjkz 


EL 


/NDGTLQI 


PTSPDGRTPN 


AAAITKQDVR 


IIAGFDLDNAY 


TIMKTLT'GGT 




ZAGKG 




MA 


rlktfkdq 


ENPTLMVDAG 


DVFQGLPI SN 


FSKGADMAKA 


mkgvgg : a v a 


VGMHEFD 


GGL 


e: 


\LGYKDQL 


NFPILSSNTY 


YKDGSGRVFD 


PYTIVEKSGK 


kfa:gg\gtg 


iti 


ATKTH 


?KL 




"VTFKDPI 


PEVEAVIKEI 


KEKYADXQAF 


WTGHLGVDE 


TTPil ivlglt 


LA 


ETLFQ 




e:. 


LTTVIDGH 


SHTAVESGKR 


YGKVIYAQTG 


IIYLNNVGIVT 


apltei t; g g 








el 


PEUPAVKA 


IVDEARTNFN 


AENEKVIVDY 


IPFTLDGQRE 


NVP.TRETCLG 


i:i 


igda: 


MLY 


Gy 


DAFSQPAD 


FAVTNGGGIR 


ADIKQGPIKV 


GDVIAVLPFG 


ns:A e : g 


AC V KEMF 


gms 


VR 


c I PQKDEN 


GTILLDDAGQ 


PKLGANGGFL 


HVSSSIRIHY 


DSGK.PGTLLA 


el: 


ZGNET 




IV 


r^SRVLGIE 


IKNRQTQKFE 


PLDEKKQYRM 


ATNDFLAAGG 


DGVLMLGGTG. 


EF 


g:sli: 


TVL 


.i :■ 


: i_jKSATSL 


RLYRAATTID 


LAQYKEPFPG 


ERIVSISEEA 


YKEL JCGGFT 


P> 


PDPK? 


G?K 


?T 


r'ETPVATN 


KQNQAGARQS 


NPSVTEKKKY 


GGFLPKTGTE 


TETLALYGLL 


F\ 


G L S S £ 


GUY 


I Y 


KRRNKAS 



EF121-3 (SEQ ID NO:451) 



ACAAAG TTCAGAAGCA GTAACAAGTA CCACCGATTC .GAGTAGaAAA 



CAAGAACCAG 


TCATTACACA 


GGAAACAACA 


GAG 




A-A 


:L\AGG 






G/PCAGGCT 


ACGAGTGACA 


GTGTCAAGCA 


GTCACAAGAA 


ACT 




CA 


ACAG-G 


^ .* \ 1 . 


g; 


.GGAATTTA 


GAAACGTCAA 


TCGCTGAAAA 


AGAAGAAACG 


AGG 




Aj l 


A/WAT 


TAG 




.TTTTAGGT 


ACGTCAGATG 


TTCATGGTCA 


ATTATGGAAT 


TG' ; 




A A 


!5 ATGATAA 


A(. 


-AACTACCA 


GTTGGTTTGT 


CCCAAGTAAG 


TACAGTCGTT 


aa: 


:aa ttgg 


GG 


g:a ca. 




c : 


^GAGCAGGC 


ACCGTTTTAA 


TTGATAATGG 


CGACAATATT 


CAA 




TT 1 


ttaag 


*V.s A 


TC 


TACTTGTAT 


AATAAAGCGC 


CTTTAGTGAA 


TGAAAAGACC 


CAG 




TG 


YCCGC 




GJ 


*^ATGTGATG 


AAGTATGATG 


CAATGGTTTT 


GGGAAATCAT 


GA ; 




TG 


;tttt 




G TT AATC AAA 


AAAATTCAAC 


AAGAAGCCAC 


TTTTCCAATC 


TT :; 




AT 








GvGGAAGAT 


GGTCTTCGTT 


TTGTTGAAGG 


GACTACCACG 


aa ; 


. p t '\J 1 - - i G 


/-i i 


tttaa. 




A^ 


ATGGGCAG 


CCAGATTTAA 


AAGTTGGGAT 


TATCGGCTTA 


AC.A 




AG 






G 1 


'GGGATGGC 


CCTCGTGTTA 


CTTCGCTTAA 




TT"5 


AAA "7 A. AG 


AA. 


d "AG A 




a.: 


; TAGTTACT 


GAGTTGAAAG 


CTAACGATCA 


GGCTGACATT 


AT n 














AATAGTGATC 


CGGCTGCCAG 


TGCCGACCAA 


GTA 


AT^ALAA 


AT 


*tgg^ 




CG 


.TTGATGCG 


TATATTCTGG 


GTCATGACCA 


CCTTTCTTTT 


AC : 


A AG :ggg 


GA 


: CAGG 




g; 


^ATGGAAAA 


ACTGTACCGG 


TAGGGGGACC 


GAAAGATACG 


GG = 




rpr-i- 


d T T AA 




T. 


AATGTTTCA 


GTTGCTAAAA 


ATGCCGATAA 


GTGGGAAGTG 


CAA 


3aa 3 :g. ; -. 




ACAAC 


.:- A I 1 


TC 


^TAGCAACA 


ACGAATGTTC 


CAGCAGATGA 


AGCAGTTAAG 


GCA 




A A 


^ m 




T, 


r/iAAAAACG 


CGAGCGTTTA 


TTCAGGAGGA 


GATCGGCACA 


gcaa:;g::tg 


AT 


TTTTT 




hs 


-JiACAAGAA 


ATTAAAGGAA 


TTCCCGAAGC 


ACAATTACAA 


CCA 




TG 


ATTTG 


TTT 


A.' 


lTTAATAAC 


GTTCAAAAAG 


AAGTAACGGG 


CGCACAATTA 


AGT 






:tgtt 


TAA 


A T A G G AC AGT 


AAA TT AC C TG 


CGGGGAAGAT 


TTCCTATGCC 


AC * 


ATTTTTG 


AT 


i-p m ^ 




ATAGCCGAAT 


ACCTTAGTGA 


GTGTTCCCAT 


TAACGGTGAA 


AA : 


TTAGTGG 


a : 


PATTT 


AG A 


A. 


^aAGAAGGG 


GCG TAG TATA 


ACCAAACACA 


GCCAGATGAT 


TT^ 




GT 


TTTAA 




A„- 


•-ACATTCGT 


GTATATAACT 


ATGACATGAT 


TTCTGGAGTG 


ga: 


TAGAAGA 


TT 


:agat 


TTT 


A_ 


-JACCAGTG 


GGTGAACGAA 


TTGTAGATGC 


GAAAATTGAC 


gg': 


:aa cggg 


t:- 


iATCC 


TG ~ 


O 


,/vAGAATAT 


ACGATTGCTA 


TGAATAATTA 


TGGTTACGGC 


ggtttagcta 


g : 


CAAGG 




T ; 


:aA'^taggg 


GAACCTATTA 


AAAATTCTGA 


TCCAGAAACC 


TT \ 




t: 


ATTGT 


TGA 


TTATATTAAG 


AAAAAAGGAA 


CTCTTGATCC 


AGAACAAGAA 


at: 




at 


TGGTC 


A AT 


T. 


-.ttgggaca 


AATTTTGATG 


AAAAATGGCG 


TGCCAAAGCA 


at: 


3AATTAG 




AATGA 




C. 


ACTTTTCAA 


ATTCCGACTT 


CTCCTGATGG 


ACGTACACCA 


aa: 















EF121-4 ( SEQ ID NO:452) 



QSSEAV TSTTDSSRKQ 

EPVITQETTD IKQEAPNQAT SDSVKQSQET TAT TE PT1G_E TGIAEKEETS TPGKITILGT 
SDVHGQLWNW EYEDDKELPV GLSQVST\AT\I QVLA.GG , ' VLIDNGDr:!^ GTILTDDLYN 



WO 98/50554 



!'( T. ITS98/08959 



TABLE 1. Nucleotide and Amino Acis Seqeu 



kapl\"KTH pkstavaatik v casta gg as e a;gaaa: g a:ty::kedg 

irfyegtttk eief:;;eg;.f cakysssggt istisa': :-' :s:.::?:m fagaekavte 

gka:;ggag:: yasikaggtg: slpaasag:'/ : :•• : . ... :: a: a sa \ . aaapngkt 

vpvggpketg 7e sat. i easy akigagkaeycj ec . at a " ' aatageagsga. /a aeyrlektr 

afiseeigta tagflpkgei kg:?ea-l;p taaaa a. ; sgatsga: as a .agfxydsk 

apagkisyat ifgiyxypat ays'.'? act?:; at -ay a v at:a.a z asficpnirv 

Y.T/DMISGVD YKICISK?'-G ERIVEAX :TG I " ! A' ' " AAAIGGGAYCS LAS ~G IQVGE 

PIKNSDPETL RGMIVAYGKH KGTAG-AT.TG STAGS:; A ^!AA • AI GGAAjGTLQI 
PTSPDGRTPN A 



EF122-1 ( SEQ ID KO:453) 

TG AAA C AC AA GGAGGAAATT TG AG AAAAAG 
CATTTTTTAA TGGCTGTTGC GTTGATAGGG 
GAAACAACGA GTSAACAAAG TTCAGAAGGA 
CAAGAACCAG TCATTACA .AA GGAAAAAAAA 
ACGAGTGACA GTGTSAAG3A G TC A C AA < 3 AA 
GAAACGTCAA TCGCTGAAAA AGAAGAAAGG 
ACGTCAGATG TTSATGGTCA ATTATGGAAT 
GTTGGTTTGT CCCAAGTAA3 TAGAGTCGTT 
ACCGTTTTAA TTGATAATGG CGASAATATT 
AATAAAGCGC CTTTAGTGAA TGAAAAGA TC 
AAGTATGATG CAATGGTTTT GG3AAATCAT 
AAAATTCAAC AAGAAGCCAG TTTTGSAATG 
GGTCTTCGTT TTGTTGAAGG GACTACCAGG 
CCAGATTTAA AAG TTGGG AT TATCGGSTTA 
CCTCGTGTTA CTTCGCTTAA TTTTTTAC ST 
GAGTTGAAAG CTAACGATSA GG ST GAG ATT 
AATAGTGATC CGGCTGCCAG TGCC SAC AAA 
TATATTCTGG GTCATGACCA CCTTT'STTTT 
ACTGTACCGG TAGGGGGAGC GAAA3ATAGS 
GTTGCTAAAA ATGCCGATAA GTGGGAAGT3 
ACGAATGTTC CAGCAGATGA AGCAGTTAAG 
CGAGCGTTTA TTCAGGAGGA GATCGGCASA 
ATTAAAGGAA TTCCCGAAGC ACAATTASAA 
GTTCAAAAAG AAGTAACGGG CGCACAATTA 
AAATTACCTG CGGGGAAGAT TTCCTAT 3 J 3 
ACCTTAGTGA GTGTTCCCAT TAACGGTGAA 
GCGTACTATA ACCAAACACA GCCAGATGAT 
GTATATAACT ATGACATGAT TTCTGGAGTG 
GGTGAACGAA TTGTAGATGC GAAAATTGAC 
ACGATTGCTA TGAATAATTA TCGTTACG3C 
GAACCTATTA AAAATTCTGA TCCAGAAASC 
AAAAAAGGAA CTCTTGATCC AGAACAAGAA 
AATTTTGATG AAAAA TGGCG TGCCAAAGSA 
ATTCCGACTT CTCCTGATGG ACGTACAGSA 
CGTAATGCGG GCTTTGATTT AGATAATGSA 
GGCCGACTAG AAGCAGGGAA AGGCGAATTA 
CAAGAAAACC CAACCTTGAT GGTGGATGGA 
AATTTCTCCA AAGGCGCGGA TAT G G C C A .AA 
GCGGTGGGAA ATCACGAGTT TGATTTTG ST 
CTGAATTTTC CGATTTTATC TAGTAATASG 
GATCCGTATA CAATCGTAGA AAAATCCGGS 



GT sag ; ; •" ; , 


AG- 


AAAG TG .AA 


G' 


: GGGGCATG 


. • AAA ■ ■ : ■ ;. 


G _ 


AGTACGG S 




AAG'GG"AGTA 


G TAAAA--. ■ ■" 




A GAG A — S 




■GTAGAAAA 










-.A SGAGGCT 






A-: SAG AAA S 




SAAAATTTA 


ag:ags;a a; 




.■^-AS/GAG 


.A. 


-TTTTAGGT 


t;aa a - : 


.-' 


A ATA AG A A 


A. 


AAACTACCA 


AASGAAAAG . ' 


C " 


CCA 3 A AAA 


G 


S SAGCAGGC 


CAT..' .A . . A 


GA 


A AAA A, AAA 


S' 


AA 7 GTGTAT 


CATS SAGA ,A 


AA 


ASCGGGAT 




-,GiGTGATG 


gagtttaa:t 


TS 


CGTTTAAC 


G' 


TTA.ATCAAA 


TT: A g A- a. 


AA 




'T 


AAG3AAGAT 


AA G G GA G G G G 




TTTAATCA 


A 


"AATGGGCAG 


ACAATTCG A: 




A TAGS ACTS 1 


G- 


TG G3ATGGC 


rpT ^ ^ ^ ; 


AA 


G !T AG. AAA A 


A 


"AAAGTTACT 


ATT ASA S S 'T 




A TTG. SAG'S 




. GASAACAA 


GT AACGAGAG. 




GTS GAG AG G 


'S 


ATTGATGCG 


acsaagsa.ag 


G/ 


G AA GG" G 3 3 


G : 


AATGGAAAA 


gg. a ;. . . . 




r^..-i- .. . 




ASTTTCA 


CAGGAAGG A 






'7 


"-SAGCAACA 












GC AA s A '- : ' " s 






^. 


'■-AASAAGAA 










ATTAATAAC 


AGTGCGGCAG 




CTGTTTAA 


A. 


TASGACAGT 


AGGATTTTTS 




ATCTAGAA 


A 


TACCCGAAT 


AASTTAGTSA 




TATTTAGA 


A- 


AAAC AAG GG 


TTG A"'" -*■ ' - 






A. 


AASATTCGT 


GAS. A. A A 








A.AACCAGTG 










AAAGAATAT 






^.•-/G S ..j -a . 




S.AAGTAGGG 


TTAGGAG A. A 








SATATTAAG 


atsgaag 








A TTGGG AC A 


at; gag-/' t as 








AGT3TTCAA 


AA.:-GAS ... ■ ■ 








SAAGATGTC 


TATA SAA • 








,AS3TTCAT 










STTAAAGAC 


GGGGATGGCG 








SGAATCTCC 


gcaatgaaa g 


A.- 


-GTTGGTTA 




AAT3CCATG 


TTASAGAGGS 




.3TAGGTTA 




AAAGACCAA 


T ATT AS. A - AS 




3 S STG-S 




AA3GTTTTT 


AAA A A AAA A.' 7 








"J TG ACGACC 
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CCAGAAACAG 


CAACGAAAAC 


ACACCCGAAA 








T3TGACATT 


CC 


.AAGACCCG 


ATTCCAGAAG 


TAGAAGCAGT 


GATTAAGCAA 


ATTAAAGA 


AA 


A 


;tacg :gga 


T.' 


ATIJCAAGCT 


TTCGTGGTTA 


CTGGGCATTT 


AGGCGTACAT 


GAA. 


acca;: 


CC 


C 


sCATATCTG 


CA 


AGTGGTGAT 


ACGCTAGCAG 


AAACCCTTAG 


TCAAACATAT 


cctgac re 


AC 


/A 


aatca::tgt 


cc 


ATTGATGGA 


CATTCGCATA 


CAGCCGTCGA 


AAGTGGCAAA 


CGT 








.GTCATCTA 




: CTCAAACA 


GGTAATTATT 


TAAATAATGT 


TGGGATCCTC 


A 'J A 




AC 


A 


AAGTGAlACC 


A / 


iCTAAGAAA 


ACAACAAAAT 


TGATTTCAGC 


AGCAGAGCTG 


CTA 




CC 


C 


vGAAAACCC 


g; 


ACAGTTAAA 


GCCATCGTTG 


ATGAAGCACG 


TACGAATTTT 


AAC 




'vA 


A 


CGAAAAAGT 


A, 


ATTGTCGAT 


TATATTCCAT 


TCACATTGGA 


TGGACAACGA 


GAA 


AATAA 


CC 


G 


AAC AC GAGA 


G. 


i CC AAC TTA 


GGGAATTTGA 


TTGGTGATGC 


GATTATGTCA 


TAT 






A 


CCCGTTTAG 




'AACCTGCT 


GATTTTGCAG 


TAACTAATGG 


TGGCGGCATT 










: haACAAGG 


G ' 


: CAATTAAA 


GTTGGGGATG 


TCATTGCTGT 


GTTACCTTTT 


gc: 


AATAA 


7 7i 


T 


AGCGC AAAT 


1; 


AAAGTAACC 


GGCGCCCAAG 


TTAAAGAAAT 


GTTTGAAATG 


T : ""T 


cat re 






AATTC CAC A 


A, 


..AAGATGAG 


AATGGCACAA 


TTTTACTAGA 


TGATGCTGGC 


CAA 


C C AAA 


AC 


T 


AGGCGCAAA 


T: 


A3TGGTTTC 


CTACATGTTT 


CAAGCTCCAT 


TGGTATC CAC 


TAT 


CATC A 






AAAAC .:ACG 


T. 


, ATCGCTTG 


GCTAGTGACG 


AAGGC AATGA 


AACAGGACAA 


AC C 


ATT A A 1 


AG 




AAGTCGCGT 


A" 


A AAGGAATA 


GAAATTAAAA 


ATCGGCAAAC 


ACAAAAGTTT 


GAA 








AG A UAAGAA 


r- 


AAATACCGG 


ATGGCTACCA 


ATGATTTCTT 


AGCTGCTGGT 


GCT 




TT 


A 


AGATATGCT 




: CTGGTGAA 


CGAGAAGAAG 


GGATTTCACT 


AGATTCTCTC 


TTA 


ATT A- 






"TTGAAAAG 


T 


^CAACCAGC 


TTGCGGTTGT 


ATCGTGCAGC 


AACGACGATT 


GATTTA A 






ATATAAAGA 


A 


ACATTCCCA 


GGCGAACGAA 


TTGTTTCTAT 


TTCGGAACAA 


GCT 


CA A- 






A TTA A ACGG 


r l 


^GAGGAGAG 


ACGCCAAAAC 


CAGATCCAAA 


ACCAGACCCG 


AAA 








AG AAA :ACC 


a 


;TAGCAACC 


AATAAACAAA 


ACCAAGCGGG 


AGCAAGACAG 


AG? 








AGTAACAGA 




AAGAAAAAG 


TATGGCGGCT 


TTTTACCTAA 


AACGGGTACA 


GAA AC A AG 






7CTTG :ATT 


A 


AATGGTTTA 


CTGTTCGTTG 


GACTTTCTTC 


TTCTGGCTGG 


TAT 


/ATT 1 A 


T/-i 




ACGAC CTAA 


C 


AAAGCTAGT 


TAG 





















EF122-2 (SEQ ID NO:454) 



VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE r I A A 






AY T3TTDS 




ga ;■ 


EPVITQETTD 


IKQEAPNQAT 


SDSVKQSQET 


TAP TEC T! " 




n 7 


I AEKEETC 


T 


^'QKITILGT 


SDVHGQLWNW 


SYEDDKELPV 


GLSQVSTWN 


CATaaa'pa 






CDKGGKIC 




A I LTDDLYN 


KAPLVNEKTH 


PMITAMNVMK 


YDAMVLGNHE 


FNFGLr :a: 






vEATFPIL 




ANTYNKEDG 


LRFVEGTTTK 


ELDFNQDGQP 


DLKVGI3 GLT 


I PHI P. v;J 


JP 


PA 


TCLNELPL 


K 


ceaekavte 


LKANDQADI I 


VASIHAGQQN 


SDPAASADQV 


iexvaa: a 






GP! DHLS FT 


KCGAAPNGKT 


VPVGGPKDTG 


TEWXIDLSV 


AKNADKWEVQ 


egtat: V7- 




IT. 


PAEEAVKA 




ataeyhektr 


AFIQEEIGTA 


TADFLPKQEI 


KGIPEAQLQP 


TA>:i£L 1 A: 


IT J 


CG 


EVTGAQLS 


;■, 


AALFKYDSK 


LPAGKISYAT 


IFDIYKYPNT 


LVSVPINGEN 


i.lkylkk: 






KQT: PP'DL 




ACFNPNIRV 


YNYDMISGVD 


YKIDISKPVG 


ERIVDAKIDG 


CFLdp,\ke 


', ' Tl 
1 X 


CArTNKYRYGG 


L 


ASQGIQVGE 


PIKNSDPETL 


RGMIVDYIKK 


KGTLDPEQEI 


EPNV'C: TG 




FDEKWFAKAI 


E 


L.VMDGTLQI 


PTSPDGRTPN 


AAAITKQDVR 


NAGFDLDNAY 


timhtl;c : 




RL 


AEAGFGEIjG 


K 


ARLKTFKDQ 


ENPTLMVDAG 


DVFQGLPISN 


FSKGADMAKA 


MIIEVGTCA 






T'HEFDFGL 




ZALGYKDQL 


NFPILSSNTY 


YKDGSGRVFD 


PYTIVEKSGK 


FFAIVGVA 


TP 




^ATCTHPF.H 


Y 


EKVTFKDPI 


PEVEAVIKEI 


KEKYADXQAF 


WTGHLGVDE 


TTPH J\ AC A 






.ETLSQTYP 




LCCTVIDGH 


SHTAVESGKR 


YGKVIYAQTG 


NYLNNVGTVT 


APECECTr 




T] 


"L I SAAELL 


E 


APENPAVKA 


IVDEARTNFN 


AENEKVIVDY 


IPFTLDGQRE 


nvrtret; 




I G L 


. IGDAIMSY 


C 


A A AFSQPAD 


FAVTNGGGIR 


ADIKQGPIKV 


GDVIAVLPFG 


ns: act;; ' 






acaeaeems 




AC IPQKDEN 


GTILLDDAGQ 


PKLGANGGFL 


HVSSSIRIHY 


3STAPGTI 






:eghetgqt 




7GSRVLGIE 


IKNRQTQKFE 


PLDEKKQYRM 


ATNDFLAAGG 


dgychcg: 




El 


CGISEDSVL 




EYLKSATSL 


RLYRAATTID 


LAQYKEPFPG 


ERIVSISEEA 


YKEL3 GAA 




PJ 


;PDPK?DPK 


T 


TFETPVATN 


KQNQAGARQS 


NPSVTEKKKY 


GGFLPKTGTE 


tetaacy: 




r 


■GLSSSGWY 




ykrrnkas 



EF122-3 (SEQ ID NO:455) 
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TG AAAAA T G G C G TGGCAAAC 










G G G 










ITCCTGATGG 


ACGTAGA 










. i- - ^ 




AAGATGTC 


GG ^ .mATGCGG 


GCTTTGATTT 






- - 






AA GAA^A 




AG GTTCAT 


GG C CGAG TAG 




AG'' ^ l G r-„-.T ^ j-\ 


GGT 




g . t 


• 




"GTAAAGAC 








* . - rt 


GGG 


gat. : 




\G 3 ATT 




ATAATCTCC 


AATTTCTCCA 


AAGG I GCG 3 A 


TaT^GGG. 


-_--A 


'o - - 


ATGG 


■ a. • • : 


:gagtg'a 




-ATGCCATG 


1 ^cGGTGGGAA 


ATv AG jAGT* 


.GATv r IG 


GGT 




gag- 


: ' ' ' '"■ 


AGGTGA 




.A^AGAGCAA 


GGTGAATTTTC 


GGATTTTATC 






TA r : 


' :av ' 




:agtgg 




GGGTTTTT 


GATCCGTATA 


CAATCGTAGA 


AAAA.TGG 




AAA 




ggat 


ggtagg 




-TAAGGACC 


GGAGAAACAG 


CAAC 3 AAAA C 


A -AC :cg 


"~-*A 


AAC 




• • 


G- AC ATT 




AA.GACCCG 


ATTC CAGAAG 


TAGAAGCAGT 


GATTAAG 


GAA 


ATT 


AAA;, v . ..- 


A AG - A 


AGGGGG. 




■TGGAAGCT 


TTCGTGGTTA 


CTGGGGATTT 


AGGC GTA 


gat 


GAA 




cgga 


TATCTG 




' GT jGTGAT 


ACGCTAGCAG 


AAA G C C TT AG 


^ „>v>-~rt ■.-.**-.. 


..«GT 


CG" 


2 a G'; 




aagtgt 




lTT jATGGA 


CATTCGCATA 


caggcgtcga 


AA GTGGC. 


VAA 


GGT 


tat; g 


;, AAGT 


'AATCTA 


G 


:tgaaaca 


GGTAATTATT 


TAAATAATGT 


TGGGA r "'" 




aga 


ggag a 


A GAG 


:g.aaa: 




.gtaagaaa 


ACAACAAAAT 


TGATTTCAGC 


AGCA j AG 


„ ^ 'j 


cta 


gaag ; g 


G G AAA, 


va.ac :c 


L : 


-gagttaaa 


GCCATCGTTG 


ATGAAGCAGG 


TACGAAT 


TTT 


AAC 




A ATGA. 


A.A.AAGT 


A: 


TTGTCGAT 


TATATTCCAT 


TGAGATTGGA 


TGGA -AA 


:ga 


GAA 


AATG . : 


g: AG 


\:;^aga 


G: 


gaagtta 


GGGAATTTGA 


TTGGTGATGC 


GATTATG 


pa a 


T *■* ^ 


■ J'J _ _ .■ ..— „ 




GTTTA'G 




.yaggtgct 


GATTTTGCAG 


TAACTAATGG 


TGGCGGC 


-.TT 


CGC 




; : 


\G7iAGG 




G/.ATTAAA 


GTTGGGGATG 


TCATTGCTGT 


gttaggt 


TTT 


GGG 


AATA.- G 


a t : a : 


-:.^aag 




A_A GTAACC 


GGCGCCCAAG 


TTAAAGAAAT 


GTTT3AA 


ATG 


TG r " 




C GG.T 


TGGAGA 




-.AAGATGAG 


AATGGCACAA 


TTTTAGTAGA 


TGATGCT 




GAA 






AGGA.AA 




- "'G'GGTTTC 


CTACATGTTT 


CAAGCTCCAT 


tcgtatg 


::ac 


TAG 


GAT ^ . _ 




a: :ag:; 




/GT GGCTTG 


GCTAGTGACG 


AAGGCAATGA 


AACA5GA 


:aa 


ACG 


ATTG : G 




GGGCGG 




TAGGAATA 


G AAA TT AAAA 


ATCGGCAAAC 


ACAAAA3 


-n— r-p 


GAA 






"AA.AGA.A 




GAATACCGG 


ATGGCTACCA 


ATGATTTCTT 


agctggt 




GGTGATGGT 




tatg:t 




AGTGGTG AA 


CGAGAAGAAG 


GGATTTCAGT 


agattgt 


GTC 


TTA 


lA TTG.— - 




^A.AA.^ 1 .' 




-G/vACCAGC 


TTGCGGTTGT 


ATCGTGCAGG 


aacgagg 


AT" 1 " 


GAG 






TA-AA3A 




:GATTCCCA 


GGCGAACGAA 


TTG TTCTAT 


TTGGGAA 


GAA 


GGG 






AATC G G 


G 


-GAGGAGAG 


ACGCCAAAAC 


CAGATCCAAA 


AC C AG AG 


ggg 


AA.: 






aA'.:a: :: 




TAGCAACC 


AATAAACAAA 


ACCAAGCGGG 


AGCAAGA 


gag 


AGC 


AAY'" ; 




■VAGAGA 




•-AGAAAAAG 



TATGGCGGCT TT 



EF122-4 (SEQ ID NC:456) 



EKWRAKAI ELVNDGTLQI 
















PTSPDGRTPN 


AAAITKQDVR 


NAGFDLDNAY 


TIMHTMG'.GIG 




EAGtK 


GEIjG 


K," 


F.L.KTFKDQ 


ENPTLMVDAG 


DVFQ'GLPI SN 


FSKGADMAKA 


MNEVGA'G.-G.GA 






DFGG 


e: 


ALGYKDQL 


NFPILSSNTY 


YKDGSGRVFD 


PYTIVEKSGK 


KFAI VGV'ITI ' 


EG 


AGTKT 


hpk:; 


VG 


KVTFKDPI 


PEVEAVIKEI 


KEKYADXQAF 


WTGHLGVDE 


TTPKIWRGCT 






QTYP 


I:G 


IGTTVIDGH 


SKTAVESGKR 


YGKVIYAC'TG 


NYLNNVGIVT 


A?ESE?T?G-G" 






AELL 




G'EriPAVKA 


IVDEARTNFN 


AENEKVIVDY 


IPFTLDGC'RE 


tTvT-.TRGI .' G . ' 




IGGA 


IMSY 




GiAFSQPAD 


FAVTMGGG I R 


AD I KQGPIKV 


GDVI AVLPFG 


NSIAQi; T. 




".KEK 


FEMS 


1 T I. 


G I PQKDEN 


GTILLDDAGQ 


PKLGANGGFL 


HVSSSIRIHY 


DSTKPGTRLA 


C r - 


gg:-:e 


TG^T 




GSRVLGIE 


IKNRQTQKFE 


PLDEKKQYRM 


ATNDFLAjAGG 


DGYDMLGGEr. 




GIGL 


DSVL 




YGKSATSL 


RLYRAATTID 


LAQ'YKEPFPG 


ERIVSISEEA 


YKELIGGGGT 


P: 




PDP?* 


FG 


GETPVATN 


KQNQAGARQS 


NPSVTEKKKY 


GGF 














EF123-1 (SEQ ID NO:457) 














TAAAATAAAA 


AATTGGTACG 


AAGTGAACGT 




GG 




TTAG 




GAGGAAGG 


ATGAAAGAAA 


TGAGAAAGAA 


TGGTCGAATG 


GTAGiACGGTT 






rtCGG 




"TGATGTGT 
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TABLE 1. Nucleotide and Amino Acid Seqeucnccs of L. fac -.ilis Genes. 

TTGTTACTTG TTCTAAATTA TGGCACA( ZA CTCATGGCTT TGGCGGAAGA G<- TTAACAGC 

GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACCAGCC AC-C AAGAAA7 G. : - "CTTAGCG 

CTTCAAGGAA AAGCACAACC AGTAACACAA GAGGTTGTAG TG OA I TAT AG 'IVCCAATGTG 

TCAATCAAAG CTGCACATTG GGCAGCGC ZC AATAATACGG GCAAGATTCA AG TGGATGAC 

CAGAAGAAAC AGATTCAAAT TGAATTGAAT CAGCAAGCGT TAGCAGATAC GTTAGTCTTA 

acgttgaacc ctacagctac agaagatgtg acgttttctt atg:;acaaca GJAACGAGCG 

TTGACGTTAA AGACTGGTAC TGATCCGACA GAATCAACG 3 CAAT:ACGAG TTCGCCAGCC 
GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCATCTACAA ACTC:TCTGT TGCTCGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GCGATAGAAA GTAAAACAAC T::AATCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATCAGTG ATI ATTTTAC AGGTGATGAA 
ACAACGATTA TCGATAATTT TGAAGATGCG ATTTATTTAA ATGCTGATGG AAC ACCAGCA 
ACACCGCCGT ATAAAGAAGA TGTGACCATT CATTGGAACT TTAACTGGTC GATTCCAGAA 
GATGTGCGAG AACAAATGAA AGCAGGCGAT TACTTCGAGT TT:AATTACC T^GCAATTTG 
AAACCTAATA AACCAGGTTC AGGTGAT7TA GTTGATGCAG A A 3G _ AATGT C7ATGGAACC 
TACACAATTA GTGAAGATGG TACGGTT 3T TTTACCTTTA ATGAG'JGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTCATTTGA ATiJATTCAGA TGGGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTATA CAAGAAGATT TC:C3:CTGT AGTGATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGC "J ATTTT JATCG AACGCCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA TGAAA3ATCA AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CC:;T3AAAGT C7ATGAGTTA 
GTGATGAATC TTGATGGAAC AATTAAAGAA GTGGGTCGCG AA;:TTAGTCC AGATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGGTGACA CGAACAAAGC GTATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCAGATG GCGGCGGCGA TGTGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG G3TTAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTA ZAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGGTGAA:' AAAGCATTCC A AAAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACGTTTG AAC TAGATTC T'.'TACATTTA 
TATTCAGTGA CATTTGATGA C AAAG G A A AT GAAGTCGTTG GAGCAGAACT T G TGG AAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCCTTT?- CAATT3ACTT T^TACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAAGTTG ATC-GAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGGTCAGG ATTIAGAAGA TGATGGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCAGTTG ATTATCAAAA TTCAACGATT 
GGTTGGACGT TAGCTGTGAA TCAAAATAAT TATTTGATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACCCAAT7 CGTTCGTTGT CAAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCA^AG GATTTCATGG TAGAAATAAC TGGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGGGGCGT ATGCCAAAAC AAGTGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTACCGAGT TAGACGCTAA C/,ATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTGGACGG ATGAAGCAGG AAACAATCAT 
CATTCAGAAG ATAGTAAACC GTTTAAACCT TTACCTGCTT TTGATTTAAA Tl'CCCAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCACTTGGA CGA r .!'TGCGGT TA.ATTTAAGT 
AATAATCGTT TAGTCGACGC CTTTTTCACG GATCCAATTT TAACCAATCA AAGCTATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCA.AT ACAAAGCCAG ATGOTTCGGT TGr'iAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATG ACA ATGGAAGAA " CAAGCGAGAA AAACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TG A TTGAATT TAAGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTATGACA ATACCGCATC T T A T AC AAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG C GAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGATCATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTATTACTG ATACACCCTC A.'CAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTACCAACG TAACAGAAGA CGGAACTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA GGAAAAGATT ACA-ACTGGA AGTTACCACC 
GATAATGAAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA AGCACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGCGGCGG GGA3TACAGA CACGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTG AT "A^GGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGCCACAG GGA TTAAAGG CAAAATTCAG 



WO 98/50554 



'(_ T I S98/08959 



TABLE ]. Nucleotide and Amino Ac:d Seqeucnccs of£. a. l \ Gene> 



- - G AA G AAAA. 




















ATTTGG 


GACCAAG GGGA 


j-u -u-i. 1 ^ C A^\ ^ T 


— ^ ^ ^ 












- 




AT C AC A 


CTTGGTGGGT 


^ 1 j ^ /~u^i 


A G A A TAG A. TT 
















G GG TAT 


AGA.GTT T GGG 


ACG/iATTAG I 






- AC TA 










- 


G'GAGCC 


3AA.GGAGCAC 


AA C 1 G AA C C A T 


* - - 




gaata 












A-„AAATG 


3ATGA3AAG 3 


^.j^K^A^ .T 


AGT - AA _ ^ _ T 




I A A AT 






. .^TG._ 


g: 




A G GACG 


GCGTTTACTG 


nTT GG j^AC-.-i 


aggttcg gggt 




^3 at: 






AAIGG 


:; 




GGCCAG 


TTAGAGGTGG 


ATA ^ T _ TAj"*--v 






GTTGA 


..'AG 




" 


A 


GA 


G GGACA 


GGCTATCTTT 


TAGACACGAC 


■ - AAi-. _ ; ^ 


TT " A 


GGT A A 


- - - 




AAT A G 


g; 




G GACAA 


ATTCGTGATG 


TTCATGTCAA 


AA T j G T G AA ' r 


taggaaggtt 


:tg 




GAAGG' 


A. 


G^G 


AAAAAA 


GACCAAGCAG 


GCAATGCATT 




GAATT 


gtcag 


ggg 


t: 


gaga: 


g; 




A GACAA 


GCAGTTCGAG 


AACACTTAGT 


CT-GGAT 1 ^ _A 




AAAAG 


G c.-. 


c;- 




G, 


AT 


GTAGCC 


CCAGGAAAAT 


ATCAATTTGT 


■ j 1 A AAA IT ■ G .^^A. 


g:ggc 


AGCAG 


GGT 


a: 


CTTTT 




AC 


AGTGAA 


CCAAGTGCTT 


TCAEGATTGG 


AGC AA GGG AAA 




G AAAC 


GAG 


g ; 


A GA.GG 1 


g; 


GA 


^GAACG 


GCTAACTTTG 


TTAACTATCA 


AGGCA GGGCT 




aat:a 


AAA 


AA 


GATGG 


g; 


AT 


GGACAC 


TTATTAAGTG 


GTGGGACATT 


TAAAGTGCTT 




AAAGG 


GAG 


a: 


.\ GGAG' 






A GAGGC 


TTGACGAGAA 


ataatgaagg 


GG.-iAA i *GTT 






ta^: : 




G GAG Z 


A.-" 




GATCGC 


TTTGTAGAAA 


CCAAAGCGCC 


AACAGG GTAC 


TTATT 


aaata 


C AG 


::: 


gagag: 


- 




GTTGAA 


ATTGCTGAGA 


AAAATGCTGG 


TAAACGAGGG 


GTCGT 


AGTTG 


CTA 


G r : 


GAC.AA. 


CG 




.GTGAGT 


TACAAAGGGG 


CTTTCGAAAT 


ggtgaaaagg 


AAT AG 


- 1 


r\ - 


AG 


CGATG 


A 




GGTGCT 


GTTTTTGAAT 


TATATGATCA 


CAATAAAGAA 


TGATT 


AGGGA 


TTA 


A~ 


g—aa: 


GG- 




G GCAAA 


GATGGCAAAA 


TTATCTTTAG 


AGACTTGGCG 


G _ j 


IACCT 




AG 


TAG AA 


A-', 




ATCAAA 


GCACCAAAAT 


TAG GAGATGG 


CGCAGATTAT 


ATCAT 


TTATG 




A* 


TTAGT 


A- ■ 




GTAGAA 


ATTCGTGGTG 


ATTTGAAAG G 


TGATCCGGAG 


ATTTT 


: :aat 












A.ATTTC 


AAAGGACGCG 


CCGTCTTTAA 


GAAAATTGAT 


GCCAA 


TGGGA 






CTTCC 




GGx.A GGATT 


TTTAAATTGT 


ATGGAATCGA 


AA>v ( „ j'^'^^iA 


AAAAT 


:tttg 




GG 


GAAGT 


AG 




AGTGAA 


AAAGATGGTT 


CATTGGGTAT 


GGAGGATTTA 


GGTG G 


TGGTA 


GGT 


AG 


G.'iATT 


AG 




GAACTG 


GATGGAACGG 


ATGGCTATAT 


GGTCAATAAA 


ca.v:: 


gattt 


A.T'i 




GTAGT 


GA 




AAGAAT 


TCAAATGATA 


aacaacca:t 


AGATGAGTTA 




CGTAA 


ATT 


AT 


IAAvG: 


A 




GTAATG 


GGACGTAAAG 


TCAACGAGGA 


AGGTCAAACC 


ttagg 


A AGTG 












G AC AAT 


GCCGATGAGC 


AG AATC AGC C 


GGAAGGTTGA 




AACAT 


TCT 


TG 


AATG G 


T: 




G GAGAA 


AAAGTTTCTG 


AAATAACAAC 


GGATAAGAC G 


G'jj _ jA 


.AATTT 


_ 1 „■ 


GG 


AJVAGG 


GG 




AATGAA 


GGGCATTACG 


TTTTAGTG3A 


AACGAAAGGA 


GGAA : 


AGGCT 


A. T 1 J 


TG 


TTA 


CA 


GGAACGCTA 


CATCCATTTG 


ATGTAACCGC 


CGAATTAGGA 


AAA.GA 


GCAGC 


CAA 


GG 


GCTTT 


AG- 




GATCTT 


ATCAATTATC 


AAGGAACTGC 


TCAATTAACC 


AAAGAAAAGG 




GA 


GGTGA 






GGTGGGGTGT 


TTAAGGTCAT 


TGATGAAAGA 


GGG GAAAVCCG 


TA'"; 


AG 


GGACA 


A r - 




.AATCTG 


ATGTCTGACA 


AGCAAGGCAA 


AGTCATTGCG 


AAAAA 


GTTAG 


C A. r 


CG 


GGAAC 


■GG 




GGTTTT 


GTGGAGACAC 


AAGCGCCAAG 


TAGCTATCTT 


CTTAATGAAA 


GGG 


CA 


AGCGC 


AG- 




GTTACG 


ATTGCCAAAG 


ACAACCAAGG 


CAAACCAGGC 


ACT AT 


2 IT AC 






GCACC 






ATT AAT 


TACCAAGGTG 


CTGCCAAGCT 


GGTGAAAATT 


GATGA 


GCAAA 


AGA 


A'l 


! " ^CTT 






GGTGCT 


GAATTTAAAG 


TGACAGATGG 


AGAGACAGGG 


CAAAC 


TGTCG 


c T 




TGATT 


A- 




GCTGAC 


AACCAAGGGT 


TAGTTCAAGT 


GAACCACTTA 


caagc 


AGGAA 


AAT 










jAAACA 


AAAGCACCGG 


ATGGTTACCA 


ACCGTCTAAG 




TGTCG 






A.GTAT 






GCAACA 


GCGAAAGACA 


AACCTGAAGT 


CGTGAATGCG 


GGCAC 


GTTTG 


TTA 




GGAGAA 


AG 




GCTGTA 


TCCAAAAAAA 


CAAAACGAAA 


TGAGCCAAGA 


ACGAA 


AGAAG 






A GAGA. 






GGTTGG 


CTTGGTTTAC 


CGAAAACCAA 


CACACAAGTC 


AATTA 








TTTAT 






GTCATG 


TTGGTCGGTT 


TGGCAAGTTG 


GCTCTTCTAT 


AAA- LAG A G C A 


AGA 




AAA 









EF123-2 (SEQ ID NO:458) 



MRKNGPMV NRWLYGLMCL LLVLNYGTPL I-1AL A E E S D 

GQLTLGEVKQ TSQQEMTLAL QGKAQPVTQE VV^HYSANVS :?GAAHG:AAP:g iGGG : GG T^VDDQ 
KKQIQIELNQ QALADTLVLT LNP TATE DVT FSYGQQ2RAL TLG'TGTDPTE EG GGTSSPAA 
SANEGSTEEA ETNSSVPRSS EEGTGASTTKA IEEKTTESTT GGAG GGG AG PTC IG GFTGDET 
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TABLE 1. Nucleotide and Amino Acid Seqeucncc.s of £. faecal is Genes. 

TIIDNFEDPI YLNPDGTPAT PPYKEPVTIH WNFNY;S"IPED VRECJIKAGDY F) ■FQLPGNLK 
PNKPGSGDLV DAEGNVYGTY TISEDGTVRF TFNEF ] TSES DIHGPFSLDT HLGOPSDGRGP 
GDWVIDIPTQ EDLPPWIPI VPDTEQQIDK £ GHFPF TPNP SAl Tl.'TVDIN 0AI4FPQTNPT 
VTETWPTGNT FKSVKVYELV MNLDGTIKEV GPKLS I DEYT VPKUCnVTIK G P TATA Y RLE 
YQTTIDEAVI PDGGGDVPFK NHATLTSDNN PI1GLPAEATV TATYOF-MLDF FPILYDEANQ 
EFTWEINYNY GEQTIPKDQA VITDTMGDNL TFEPDFLHLY £ VTFDEKGNE YY GAEL VEGK 
DYKWINGDG SFAIDFLHDV TGAVKIDYKT KVPG1YEGDV AVNIIFVDVGT G-r H SEDDGTA 
SQQNI IKNTG AVDYCjNSTIG WTLAVNQNNY LKEUAYITDT YEPYPGLTMY PI i.7 L-WKDTT 
TGAQLTLGKD FMVEITRNAD GETGFKVSFI GAYAE7SDAF H I TYTTFFDY TGAGANTIPAL 
DHYRNTAAID WTDEAGNNHH SEDSKPFFPL PAFDLPAQKS GYYMAVTKEI TG7I AV7JLSN 
NRLVDAFLTD PILTNQTYLA GSLKVYEGHT EPPGSYEKVK PTAPFTDITM E7P "EKNQNT 
WRVDFPNDSR TYVIEFKTSV DEKVIEGSAS YPNTASYTNQ GS3PPVTGFV £G TgGGESVK 
KGGEYHKDDP DHVYWHVMIN GAQSVLDDW ITPTPSPNQV LDPEfLVIYC T! A ; 7EDGTIT 
PDKSVILEEG KDYTLEVTTD NETGQQKIW KMAH IEAPYY MEYF.PLVTFF A/,:.:TDWSN 
QVSITGNGSE WHGDDNGDV WD1DHSGGH A7GTKOEIQL KKTAKPETTZ IA.GAKFQIWD 
QAKTQVLREG TVDATGVITF GGLPQGQYIL VFTKAPEGYT VSPEI AKGRY 17LPEETSAE 
GAQPTI IKND VNKVFLEKMD EKGKKLVNAR FFLEHAVTTP FTHV.'EEVPLA PI" RTI JA1IGQL 
EVDSLKPGLY QFTEIEAPTG YLLDTTPKRF IVTC'DTSGQI FDVJiVTI^Lrn' C--AELIKKD 
QAGNPLAGAE FSVLDTTGQA VREHLVSDAN Gr VTVTDLAP GKY i ~'F v ETP7A PAGVPLUTEP 
SAFTIAASDR GKPATVIATA NFVTJYQGTAK L1FPPVNGHL I GG A.TFKVPP AFGblTIQTGL 
TTNNQGEIVA EHLAPGKYRF VETKAPTGYL LPTTP^PFEI AEPPACKPA^ VYA" ONFVSY 
KGAFQIVKTN SADQPLAGAV FELYDHNKQS LCG TA/PSGKD GKIIFKDLAP G7YYYKEIKA 
PKLPDGADYI IYPELVKVEI RGDFKGDPEI FOPGAFANFK GRA"-* FFKI DA NANPLPGTIF 
KLYRIENGEK IFEREVTAEK DGSLAMEDLG AG SY ELDELD ATDCY 1 VNKQ PIYFWKKNS 
NDKQPLDELE FVNYQAEVMG RKVNEQGQTL AGA^FAIYNA DEO.'JOPQGSl 1 Z GAP! JRAGEK 
VSEITTDKTG EIYAKGLNEG HYVLVETKAP TGYLLDTTLH PFDYTAQLGF. EAFGALGDLI 
NYQGTAQLTK ENETGEALAG AVFKVIDETG QT^HGQTNLM ^DKGOKVIAK PI .APGTYRFV 
ETQAPTSYLL NETPSASFTI AKDNQGKPAT WIY-'APFINY QGA<\KLVKID QGKUALAGAE 
FKVTDAETGQ TVARSLRSDN QGLVQVNHLQ PGKY7FVETK APDYG.'LSKC: AVAFTIAATA 
KDKPELVNAG TFVNEKQPVS KKTKPNQPTT KQAARETGWL GLPKT! JTQVI I YFF'TIGLML 
VGLASWLFYK KSKK 

EF123-3 (SEQ ID NO:459) 

GGAAGA GGTTAACAGC 

GATGGCCAGT TAACGTTAGG AGAAGTGAAG CAAACGAGCC AGGAAGAAAT GA 7CTTAGCG 

CTTCAAGGAA AAGCACAACC AGTAACACAA GAG 'JTTGTAG TG7ATTATAG TGCCAATGTG 

TCAATCAAAG GTGCACATTG GGCAGCGCCC AATAATACGC G AAAGATTCA AGTGGATGAC 

cagaagaaac agattcaaat tgaattgtjvt cag:aagcgt tag:a:;atac gttagtctta 
aggttgaacc ctacagctac agaagatgtg acggtttctt atgga'jaaca ga aaggagcg 
ttgacgttaa agactggtac tgatccgaca ga-ytcaacgg caatgacgag r: : ; ^:cAGCC 

GCATCAGCGA ATGAAGGTTC AACAGAAGAA GCA r A7ACAA A Z'Z'Z ATCTGT T'GFGGTTCG 
TCCGAAGAAA CTGTCGCCAG CACGACAAAA GC GAT AG AAA GTAAAACAAA T ; . A AFCGACG 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA GATATGAGTG AT7A77TTAC A ' 7GATGAA 
ACAACGATTA TCGATAATTT TGAAGATCCG ATTTAGTTAA ATG :7 3ATGG A'AW-CAGCA 

acaccgccgt ataaagaaga tgtgaccatt cattggaact ttaa.:tggta ggttccagaa 

GATGTGCGAG AACAAATGAA AGCAGGCGAT TAGTTCGAGT TTCAATTACC TGG "AATTTG 
AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGATGCAG aagg:aatgt ctapggaacc 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAG7GAAT CAC ^TCTGAA 
AGTGACATTC ACGGGGACTT TTCTTTAGAT ACTGATTTGA ATGATTCAGA TGG 3GGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAA 3AAGATT TGCG 3CCTGT AAT5ATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAAGGCC ATTT TGATCG AAC^TCCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAAGCGA 1 GAAAGATCA AAG 'iAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CC ETGAAAGT C7"GGAGTTA 
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GTGAAGAA.7C 








... . , . 


AA. _ 


_ - 






- — ■ - - rp f. rp 


ACCG77GATA 


.-^."^ . ■ j o - 


7G7GAGGATT 




■ --r . 2 A 


: ^ C 








. . . _-TCTT 


GAGTACCAAA. 


* " ~" ~ "~ * ' 




_ - 


- ■ ■- „■ .-. - 






^ ^ ^ 






/^rvAAAT ^ ACG 




A A G ^ G A .AA T 




- ' - ^ J 


<GG „ 




-tAI'G g 


T . 


.AAG- GAACT 


G TT A G C G C C A 


C A ' PA T G G G AA 


AG 1 . TG7TAGAC 






TA G 




TACGA 




G A G GC AAT 


G AAG AATTC A 




T AA. G T A C AA G 


TA ; ■ 


- . - . 


AAA 




A.TT ? G 




AAGACCAA 


GG AGTCATTA 




GGGG GA7AA.7 


TGAG 


■ -.: - - - <0> 


AA G 




GAT7C 




GAGATTTA 




C A. T T 7 G A. T G A 


/ AAA 3 ■ G AA_A T 






GAG 




GAA TT 


T . 


7 GGAAGGA 




AA j ■ j j i AA. ^ 




Gv:- - 




CAA 


* - 


GACTT 




'GAG ATG AT 


GTGACTGGCG 


GA jT - AAGAT 


TGA7 7 ATAAA 






ATG 




-ATT 37 




A A GGGGAT 


GTTGCCGTGA 


ATAATCGTGT 




Ac; ; 




at; 




GAAGA 


TG 


A7GGGACA 


GGGAGTCAAG 


AAAATATTAG 


T A AAAA C AC T 


GC'G. 




AT 'I 




GAAAA 


T". 


GAACGATT 


GGTTGGACGT 


TA 5 CTG7G AA 


T G AAAAT AAT 


TAG" 


. G GATGG 


AAA 




3CC37 


G .-" 


77AGGGAT 


AGGTACGAAG 


CAGTTCCTGG 


/GTAAGTATG 


GTA. 


■g: aatt 


CGI 




GTT G7 


c. 


aggaataca 


ACC ACTGGTG 


CTCA 3 TTG AC 


GG TA3 3CAAG 


GAG' 


a: atgg 


TAG 




ATA AG 




;aggatgca 


GATGGTGAAA 


CAGGCTTTAA 


GGGTAAGTTTT 


at;/ 




ATG 




^-^AAaG 




gtgatgcc 


TTCGACATAA 


CTTATACTAG 


:CTTTTCGAT 






TAG 




GCTAA 




gg: gtgcg 


TTGGACCATT 


ATCGAAATAG 


CC-C TGGCATT 


GA'G 















EFI23-4 ( SEQ 13 NO:460) 
EEVNSD 

GQLTLGEVKQ TSQQEMTL.AL QGAAQPVTQE 
KKQIQIELNQ QALADTLVGT LNPTATEDVT 
SANEGETEEA STNSSVPR.GS EE7VASTTKA 
TIIDNFEDPI YLNPDGTPAT FPVKEDVTIK 
PNKPGSGDLV DAEGNVYG7Y TISECGTVRF 
GDWVIDIPTQ EDLPPWIPI VPDTEQ£IDK 
VTETWPTGNT FKSVKVTEGV LCGCCGTIKEV 
YQTTIDEAVI PDGGGDVPFK KHATLTSDNN 
EFTWEINYNY GEQTI PKDQA VITETl-IGDNL 
DYKWINGDG SFAIDFLHCV TGAVK ITYKT 
SQQNI IKNTG AVDYQHSTIG W7LAVT1QMNY 
TGAQLTLGKD FMVEITRNAD GETGFf^VSFI 
DHYRNTAAID W 



VvGG-G 


:• a::gs 


IKAAH 


IGAAPN 




I QVDDQ 


FSGG 


: : cg/al 


TLGAG 


TDPTE 




7= SPAA 


I EGG/ 


G'GGGTT 


VKIAG. 


AG PTC 


I G GA 


] 'TGDET 


l\GGG/ 


'GAPED 


VRi ] 


GJVGEY 




APGNLK 


tf::g 


■ 7 7f : ES 


DIK3E 


FSLHT 






QG: 


AAPUP 


S A I TV 


r rvciw 




.3QTNPT 


GRGG 


'A GGYT 


VCPG/C 


: r^T I k 




GGAYRLE 


pr/ G 


GAEATV 


TA7YG 


?g^l:k 




GGEANQ 




GGLHLY 


SVI'FG 


DKGNE 


V. G/" 


GG-jVEGK 


kv: G 


A.AGDV 


AVIGA' 


■/DVGT 


G H 


7ADGTA 


lme:-; 


A'ATDT 


YEPVP 


GLTMV 


?gg: 


G.77KDTT 


GAY A 


\7GGAF 


HI7YG 


AFFEG^ 




JANNPAL 



EF124-1 {SEQ ID NO: 461) 

TAAAATAAAA AATTGGTAGG A/V'^TGAACGT 
ATGAAAGAAA TGAGAAAGA7\ TGGTCCAATG 
TTGTTACTTG TTCTAAAT7A TGGCACACCA 
GATGGCCAGT TAACGTTAGG AGAAGTGAAG 
CTTCAAGGAA AAGCACAAGC AGTA/^CACAA 
TCAATCAAAG CTGCACATTG GGCAGCGCCC 
CAGAAGAAAC AGATTCAAAT TGAA7TGAAT 
ACGTTGAACC CTACAGCTAC AGAAGATGTG 
TTGACGTTAA AGACTGGTAC TGATCCGACA 
GCATCAGCGA ATGAAGGTTC AAGAGAAGAA 
TCCGAAGAAA CTGTCGCCAG CAGGACAAAA 
ACTGTCAAAC CGCGCGTAGC AGGACCAACA 
ACAACGATTA TCGATAATTT T3AAGATCCG 
ACACCGCCGT ATAAAGAAGA TGTGACCATT 
GATGTGCGAG AACAAATGAA AGCAGGCGAT 



TC" 




"7 AT 


GTGTGGTTAG 


T/- 


GA G AAAGG 


GTA 




GGTT 


GGC7C7ACGG 


GG 


GGATGTGT 


CTG 




GGTT 


TG'/GG GAAGA 


G:" 


77A.ACAGC 


CAG 




AGGC 


AGAAGGAAA/G 


CA 


GCTTAGCG 


GAG 




G7AG 


TGCAT7ATAG 




/GAATGTG 


AA/ 




AGGC 


GCA.AGATTCA 


AG 


7G7ATGAC 


CAG 




GGGT 


TAGCAGATAC 




PAG TC TTA 


ACC 




^'~TT 


ATGGAGAAGA 




AG : GAGCG 


GAG 


TGG 


AG GG 


CA.ATCA.CGAG 




GG 2 GAGCC 


GCG 




G GAA 


AC'I'CCTCTGT 


t: 


GTGGTTCG 


GCG 


AT/ 


GAAA 


GTAGV.A.ACAAC 


T/ 


AATGGACG 


GAG 


at; 


AGTG 


ATTATTTTAC 


AC 


GTGATGAA 


ATG 


/ ^ - 


TTAA 


ATCCTGATGG 


a; 


GA/GAGCA 


CA/ 




A7iCT 


TTA A C TGG TC 


g; 


7TGCAGAA 


TAG 


TTG 


AAAGT 


TTC AATT ACC 


t: 


GG/^ATTTG 
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TABLE 1. Nucleotide and Amino Acid Scqcucncjs of E. faecal is Genes. 

AAACCTAATA AACCAGGTTC AGGTGATTTA GTTGAT..*CAG AAGG 7 AATGT CTATGGAACC 
TACACAATTA GTGAAGATGG TACGGTTCGT TTTACCTTTA ATGAGCGAAT CACGTCTGAA 
AGTGACATTC ACGGGGACTT TT77TTAGAT ACTCAT7TGA ATGATTCAGA T:- GGCGGGGC 
CCAGGAGATT GGGTGATTGA TATTCCTACA CAAGAA -:ATT TGC C7-CCTGT A CTG ATTCCA 
ATTGTCCCAG ATACCGAACA ACAAATTGAT AAACAA -GCC ATTTTGATCG AA:: - 7CCAAT 
CCTAGTGCGA TTACTTGGAC GGTAGATATC AATCAA'-CGA TGAAAGATCA, AACAAATCCA 
ACTGTGACGG AAACATGGCC AACAGGGAAT ACCTTTAAGT CCGTGAAAGT CTATGAGTTA 
GTGATGAATC TTGATGGAAC AA T T AAAG AA GTGGGT.'GCG AACTTAGTCC AAATGAATAT 
ACCGTTGATA AAAATGGCAA TGTGACGATT AAAGCTGACA CCAACAAAGC G7ATCGTCTT 
GAGTACCAAA CGACGATTGA CGAGGCGGTT ATTCCA^ATG GCGGCGGCGA r J'< :TGCCTTTT 
AAAAATCACG CGACGTTAAC AAGTGATAAT AATCCAAATG GG7TAGATGC TGAAGCAACT 
GTTACCGCCA CATATGGCAA AATGTTAGAC AAGCGCAATA TAGATTACGA CGAAGCCAAT 
CAAGAATTCA CTTGGGAAAT TAACTACAAC TATGG7GAAC AAAG CATTC " A A AAGACCAA 
GCAGTCATTA CAGACACAAT GGGGGATAAT TTAACG7TTG AACC/.GATT^ 777ACATTTA 
TATTCAGTGA CATTTGATGA C AAAGG AAAT GAAGTCGTTG GA 7C A GAACT 7GTGGAAGGA 
AAAGATTACA AAGTGGTAAT CAACGGAGAC GGTTCC7TTG CAATTGACTT T7TACATGAT 
GTGACTGGCG CAGTCAAGAT TGATTATAAA ACCAAA :.;TTG ATGGAATTGT CGAAGGCGAT 
GTTGCCGTGA ATAATCGTGT GGATGTTGGC ACTGG77AGC AT7CAGAAGA TGATCGCACA 
GCCAGTCAAC AAAATATTAT TAAAAACACT GGTGCA 'JTTG AT7A f 7CAAAA T7CAACGATT 
GGTTGGACGT TAGCTGTGAA TTAAAATAAT TATTT-^ATGG AAAATGCCGT GATTACGGAT 
ACGTACGAAC CAGTTCCTGG CTTAACTATG GTACC /AATT CG'JTGGTTGT C.AAAGATACA 
ACCACTGGTG CTCAGTTGAC GTTAGGCAAG GATTT:ATGG TAC-AAATAAC TCGTAATGCA 
GATGGTGAAA CAGGCTTTAA GGTAAGTTTT ATAGG :.7CGT ATGCC AAAAC AA3TGATGCC 
TTCCACATAA CTTATACTAC CTTTTTCGAT GTTAC.'AAGT TAG AC GCTAA CAATCCTGCG 
TTGGACCATT ATCGAAATAC CGCTGCCATT GATTG ."ACGG ATGAA 3CAGG A.--AC AATCAT 
CATTCAGAAG ATAGTAAACC G7TTAAACCT TTACC7GCTT TTGATTTAAA 7 A07CAAAAA 
AGCGGTGTTT ACAATGCCGT CACCAAAGAA ATCAC7TGGA CGATT ^CGGT 7, TAT "I TAAGT 
AATAATCGTT TAGTCGACGC C7TTTTGACG GATCCAATTT TAA.CC AATCA AACC7ATTTG 
GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAA ACCAG ATGG7TCGGT TGAAAAAGTG 
AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGC3AGAA A AACCAAAAT 
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTAC : 7 ATG TGATl 3AATT T/AGACGTCT 
GTTGATGAAA AAGTTATCGA AGGTTCGGCT AGTTA7GACA ATACC 3CATC TTATACAAAC 
CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTC7ATTC AACA'I ^GTGG CGAATCAGTG 
AAAAAAGGTG GCGAATACCA CAAAGATGAT CCAGA'7::ATG TGTACTGGCA TGTAATGATC 
AATGGCGCCC AATCGGTTTT AGACGATGTG GTTAT7ACTG ATACACCCTC A -CAAACCAA 
GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTAC 7AACG TAACAGAAGA Cl-GAACTATT 
ACGCCAGATA AATCTGTTAT T7TAGAAGAA GG AAA. AG ATT AC ACACTGGA A.7TTACCACC 
G AT AA TG AAA CAGGACAACA AAAAATTGTC GTTAAAATGG CCCATATTGA A^CACCTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAGC : GCGG GGAG7ACAGA C-CGGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGT 7GTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATGC "AC AG GGAC7AAAGG CAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTT7AGCAG GC ^CCCATTT C AAATTTGG 
GACCAAGCTA AAACACAAGT C 7TACGTGAA GGTAC , :-TAG AT "ICGACCGG C7T7ATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGT 7GAGA CAAAAGCACC A -AAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTACTA TT^ATGAAGA A AC7TCAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GATG7CAATA AAGTATTTT7 I-. "'AAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTCAATGCT CGCTT^AAAT TAGAGCATGC CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCGCC GGATC GAACCAACGG CA„A7GGCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TATCA "-TTCA CAGAAATCGA A GCACCGACA 
GGCTATCTTT TAGACACGAC CCCCAAACGA TTCA^ I'GTGA CACAAAATA: GA5CGGACAA 
ATTCGTGATG TTCATGTCAA AATGCTTAAT TACCAAGGTT CTGC7GAACT AATTAAAAAA 
GACCAAGCAG GCAATCCATT AGCAGGTGCT GAAT7 TTCAG TC7T7GACA7 C-.CAGGACAA 
GCAGTTCGAG AACACTTAGT TTCGGATGCA AACGG.AAAAG TCACAGTGA: GGATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGCC AGC AG GG7ACCTTTT AAACACTGAA 
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C3AAGTG3TT 




agcaagcgat 


f" ' : ; ^ " 


' ■*■ -"■ - C 


CA 


337A3A'A ' 


.7 A 


' AAA CG 


G3T3ACT77G 


tt^-u^ - ^ ^ 




AAAT7 


7A3 3A 


AA 


A3. 3AT37' : 


337 


3 3ACAC 


TTATTAA3TG 


GTGCGACA7T 


t.aaagtggtt 


G A7G 


a a: ag 


GA 


AAA3GA : 




. /iGvjC 


TTGACGA3AA 


>l T T .-vrv 3 G 


GGAAATTGTT 




AAGT 


TA 


■2. 33A.-A ; 


AA 


77ATGGC 


7*7 7GTAGAAA. 


^- _ vj: 


aagagg3tat 


77 A 7 7 


"■.A ATA 


G3 


'•-3 . j :;, : : ; 


" 3 7. 


777 3AA 


ATTu _ TGA.G.**. 


AAAA.TG3 . jG 


C n. H ^ M .'.- _/vG^.Cj 


-,, ■ - 


- : 3TG 


37 


_ 3 




"TGAGT 


TAG AAA 33GG 


gtttggaaat 




AA7A7 


J 3 3 AG 


A3 


3 AA 3 3 ATT 3 


: ^ ,\ 


GGTGCT 


GTTTTTGAAT 


TA . AT J/-.T - A 


7, AA 7 AAA C AA 


3CAT7 


A-GGA 


T7 




737 


33 2 AAA 


3ATGGC AAAh 


TT AT r? 4 1 AG 


G j-. _ . 7 3 Cj C G 




7 AG 3T 


AT 


7ATTACAA ; 


AA 


AT 7 AAA 


G 3ACGAAAAT 


TA 3 C AG AT 3G 


:g :a tattat 


A77A7 


777,73 


CT 


3AATTACT 7 


AAA 


3TAGAA 


ATT3GTGGTG 


ATTTCAAA3G 


7 . ^ A 7 7 I 3 GAG 


ATTT7 


""" AAT 


TA 


3G3 3CGTT ; 


: 3 3 


AATTTC 


AAAGGACGCG 


CCGTCTTTAA 


j;-iA w ^' v77G AT 


G 3 3 A3 . 


733GA 


a: 


3ga:tt3 3 ; 


. AA 


A 3 3 ATT 


TTTAAATTGT 


ATGGAATC 3A 


A.AA 3 5 3 3G AA 




'"^TTG 


AA. 


V37AAAGT 3 


".37 


3CTGAA 


AAAGATG GTT 


CATTG3CTAT 


G 3 AG 5 AT7TA 


GGT37" 


7G3TA 


G3 


7A7 3AATT 3 


7A7 


3AACTG 


G A TG C AA 3 GG 


atgggtatat 






'.' ATTT 


a: 


7T: 3TAGT • 


: ;. : 


A. A 3 AAT 


TCAAATGATA 


AA3AA33A3T 


A 3 ATG A 3 TT A 


GA 37 7 


: G3AAA 


at 


7A ; ; ;aa"-:: ; 


AAA 


GTAATG 


GGACGTAAAG 


TCAACGAG "A 


AG3T7AAACC 


i ^ A ~ ■■ - 


" 33TG 


GA 


3T7TTT3 3 7 


.ATT 


TA 3 AAT 


GC3GATGAGC 


AGAATCAGGC 


CCAAG3TTCA 




AA3AT 


t: 


TTG.AATCG ' 


: 3 A. 


GGAGAA 


AAAGTTTCTG 


AAATAACAAC 


G3ATAAGACT 






a: 


3 37AAAGG ■: 


- 7'T.A 


7ATGAA 


GGGG ATTACG 


TTTTAGTGGA 


A A 2 ' 3 AAAG C A 


ccaaa 


A33GT 


AT 


3TGTTAG.A 


7.3.A. 


A3 3CTA 


CATCCATTTG 


ATGTAACCGC 


C 3 AATTAGGA 


hAAG.--. 


33AGG 


CA 


ATT 3CTTT 


. : 7 : 3 


GAwCTT 


ATG AATTATC 


AAGGAACTGC 


t:aattaagc 


AAAG.A 


7AA3G 


AA. 


A3A3GTGA .■ 


-.7 73. 


TTGGCA 


GGTGCGGTGT 


TTAAGGTCAT 


73ATGAAACA 




AACCG 


TA 


GATGGACA 


, :,C3 


AATCTG 


ATGTCTGACA 


AG 3AAGGCAA 


A3-, 3 ATTG C G 


AAAAA 


3 77 AG 


CA 




r AA7 


3GTTTT 


GTGGAGACAC 


AAGCGGGAAC 


7AG:TA7CTT 


CTTAA 


'33, AAA 


C3 


3 w ^.^-o _ '^'^ 


3.3 3 


77TACG 


ATTGCCAAAG 


ACAACCAAGG 


caaa:cagcc 


A3TG7 


33TAC 




i.An 3GA 3 3 




377 AAT 


TACCAAGGTG 


CTGCCAAGCT 


G3T3AAAATT 




3 3 AAA 


AG 


; iAiT 33 3TT . 




3 ;tgct 


GAATTTAAAG 


TGACAGATGC 


A 3A3ACAGGG 




— G 


CT 


3GTT3AT7 . 




'73TGAC 


AACCAAGGGT 


7A3TT3AAGT 


GAA - 2 AGTTA 


G A^v3 ,: ., 


AG3AA 


AA 


TATAC3TT ' 




37AACA 


AAAGCACGGG 


ATGGTTACCA 


A3TGTCTAAG 


CAAG'.V 






M T3 ACTA.T 




7 AAC A 


GCGAAAGACA 


AACCTGAAGT 


C3T3AATGCG 


GG3AC 


377TG 


TT 


AACGAGA3\ .• 




3 7TGTA 


TCCAAAAAAA 


CAAAACCAAA 


T3AG3CAACA 


ACGA3- 


A3AAG 


CA 


33TAGAGA ■ 




33TTGG 


CTTGGTTTAC 


CGAAAACCAA 


CACACAAGTC 


AATTA 




TT 


3TCTTTAT 




CTCATG 


TTGGTCGGTT 


TGGCAAGTTG 


G3T3TTCTAT 


AAAAA 


3AGGA 


AG 


AAA T AA 







EF124-2 (SEQ ID NO:462) 



MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVr 


sc 










GQLTLGEVKQ 


TSQQEMTLAL 


QGKAC7PVTQE 


WVHY:7 


A37VS 


I?' 


AAHWAAPi: 




?::cvddq 


KKQIQIELNQ 


QALADTLYLT 


LNPTATEDVT 


FSYGA. 


A?. AL 


TL' 






^"7^SPAA 


SANEGSTEEA 


STNSSVPF.SS 


EETTA'ASTTKA 


IESKT7 


TT C T"^ 


vt: 


^RYAGPTG 




777GDET 


TIIDNFEDPI 


YLNPDGTPAT 


PPYKFDVTIH 


WNFN^.-.'i 


2 3 ED 


VF. 


TwKKAGDY 




:3?GNLK 


PNKPGSGDLV 


DAEGNVYGTY 


TISE3GTVRF 


TFNEF7 


T3ES 


d: 


iGGFSLDT 




35LGRGP 


GDWVIDIPTQ 


EDLPPWIPI 


VFDTEQC'IDK 


CGHF3 7 


T7r!P 


SA 


:T/;TVDr:: 




?:dctnpt 


VTETOPTGNT 


FKSVKVYELV 


MNL7GTIKEV 


GFEG3 7 


37YT 


VD 


-:ngcttik 


-■ - rp 


MFA.YRLE 


YCTTIDEAVI 


PDGGGLVPFK 


NHATLTSDMN 


PKG73A 


EATV 


TA 


7Y3KML3?; 




DYDEANQ 


EFTWEINYNY 


GEQTT PKDQA 


VITC7 r MGDNL 


TFEP3: 


3ALY 


SV 


7F3DKGME 




AELVEGK 


DYKWINGDG 


SFAITFLHDV 


iga\t:idykt 


KVDG3 7 


"EGDV 


AV 


:NPVDVG7 


3.H 


SET DGTA 


GQQNIIKNTG 


AVDYQNSTIG 


WTLA\ r NQNIJY 


LMENA7 


■:tdt 


YE 


?v?glt:7v 




L7A7KDTT 


TGAQLTLGKD 


FMVEITRIIAD 


GETGFKVSFI 


GAYA7 7 


^DAF 


HI 


7Y7TFFDV 


T AL 


GAMNPAL 


DHYRNTAAID 


WTDEAGNIIHH 


EEDSKPFKPL 


PAFG3: 


IAvKS 


GV 


i'navtke: 




I AYMLSN 


MRLVDAFLTD 


PILTNQTYLA 


gelkvtegnt 


KPDGS7 


'E7-7/K 


PT 


7 P3TDITM 


E3P 


SEKMQNT 


WRVDFPNDSR 


TYAMEFKTSV 


DEK T 7IEGSAS 


YE OTA 7 


VTICQ 


GS 


7.R7VTGKY 




HGGESVK 


KGGEYKKDDP 


DHVYWKVI-1IN 


GAQ'SVLDDW 


I to?:; 


T3GV 


LD 


7E3LVIYG 




TEDGTIT 
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PDKSVILEEG 


KDYTLEVTTD 


UETGQQVIW 


kmah: 


• a?yy 


MEYRSLVTEG 


A 


•.GSTDTVSN 


QVSITGNGSE 


WHGDDNGDV 


WDIEHSGGH 


ATGT: 




kktamdett: 




•.GAHFQIWD 


QAKTQVLREG 


TVDATGVITF 


GGL?C:GC-YIL 


vetk; 


t-'EGYT 


VSDELAKGR\' 


7 1 


^IDEETSAE 


GAQPTI IKND 


VNKVFLEKMD 


EKGK.K LVNAR 


fklei 


V'TTP 


FTLl'EEVPLA 


I ■ 


;rtnangql 


EVDSLKPGLY 


QFTEIEAPTG 


YLLETTPKRF 


IVTC'I 


j-agqi 


RDVI'VKMLNY 


c 


'SALLIKKD 


QAGNPLAGAE 


FSVLDTTGQA 


VREHLVSDAN 


GKV: r - 


7LLAP 


GKYCFVETKA 


I" 


-.GY LLNTEP 


SAFTIAASDR 


GKPATVIATA 


NFVUYGGTAK 


LIKKI 


/hghl 


LSGATFKVLL 


A 


■GETIQTGL 


TTNNQGEIVA 


EHLAPGKYRF 


VETKA.FTGYL 


LNTTJ 


ALFEI 


AEKIIAGKPA^ 




■AEDNFVSY 


KGAFQIVKTN 


SADQPLAGAV 


FELYDHIJKQS 


lg:t; 


TAGKD 


GK_ 7 FRDLAP 


G 


:yyykeika 


PKLPDGADYI 


IYPELVKVEI 


RGCFKGDPEI 


FQLG.- 


LAMFK 


gf a\tkkila 


r 


AJPLPGTIF 


KLYRIENGEK 


IFEREVTAEK 


DGSLAMEDLG 


agsy: 


Li.: ELD 


atlcyivnfc. 1 


r 


TYFV^'KKNS 


NDKQPLDELE 


FVNYQAEVMG 


RKVIJEQGQTL 


AGAVj 


A : YNA 


DELIIQPQGSL 




PFL.KRAGEK 


VSEITTDKTG 


EIYAKGLNEG 


HYVLVETKAP 


TGYLI 


GGGLH 


PFPA-'TAQLGL 




..PIALGDLI 


NYQGTAQLTK 


ENETGEALAG 


AVFKVIDETG 


QTVDC 


;t:jlm 


SDLQGKVIAK 




^APGTYRFV 


ETQAPTSYLL 


NETPSASFTI 


AKDIK'GKPAT 


WLYJ 


PLINY 


QGAAKLVKI P. 




;kwalagae 


FKVTDAETGQ 


TVARSLRSDN 


QGLV0V13HLQ 


pgky f : 


F' ' ETK 


APDGYQLSFC 




AAFTIAATA 


KDKPELVNAG 


TFVNEKQPVS 


KKTKPNQPTT 


KQAAI 


L : oWL 


gllkthtqyi:- 




■FVFJGLML 


VGLASWLFYK 


KSKK 















EF124-3 ( SEQ ID NO:463) 

tgccttccacataacttatactacctttttgacg ga r " a laattt 'aaaccaatca aac ctatttg 
gctgggagct tgaaagtcta tgaaggcaat ac aaa g ;cag atggttcggt tg aaaaagtg 
aaaccaacgc aaccgttgac ggatatcaca atggaa ^\ac ca/.gggagaa a lag gaaaat 
acttggcgtg ttgattttcc taa7gatagt cgta; gtatg tgattgaatt n aagacgtct 
gttgatgaaa aagttatcga aggttcggct agtt; t ~-aca at;.:cgcatc : aatacaaac 
caaggttctt cacgtgatgt gagaggaaaa gttt tattc aagatggtgg gg aatcagtg 
aaaaaaggtg gcgaatacca c aaa g a tg at ccag/t7atg tgtactggca 7 7taa7gatc 
aatggcgccc aatcggtttt agaggatgtg gtta'.'tactg at/.gaccctg a ggaaaccaa 

GTGCTAGATC CCGAGTCATT GGTGATTTAC GGTA .CAACG TAAGAGAAGA G a:;AAGTATT 
ACGCCAGATA AATCTGTTAT TTTAGAAGAA G G AAA JV ^ ATT AC..GACTGGA AGTTACCACC 
G A T AATG AAA CAGGACAACA AAAAATTGTC GTTAAAATGG CC GATATTGA A. A 7AGGTTAT 
TATATGGAAT ATCGTAGTTT AGTGACTTCT TCAG AAJA:GG GGAGTACAGA CA7GGTATCC 
AACCAAGTGT CAATTACTGG AAATGGTTCA GAAGTCGTTC ATGGGGATGA CAATGGCGAT 
GTGGTCGTTG ACATTGATCA CAGTGGCGGG CATG7GA7AG GGACTAAAG3 GAAAATTCAG 
CTGAAGAAAA CAGCCATGGA TGAGACGACT ATTTAAGCAG GC.^GCATTT C'JAAATTTGG 
GACCAAGCTA AAACACAAGT CCTACGTGAA GGTA LA STAG AT 7CGACCGG C- GTTATCACA 
TTTGGTGGGT TGCCACAAGG GCAATACATT TTGGTG3AGA CALAAGCAC7 AGAAGGCTAT 
ACAGTTTCGG ACGAATTAGC TAAAGGCCGA GTCATTA :TA TTGATGAAGA AAGTTEAGCC 
GAAGGAGCAC AACCAACCAT TATTAAAAAC GAT jTCAATA AA VTATTTTT AGAAAAAATG 
GATGAGAAGG GTAAAAAGTT AGTGAATGCT CGCTTTAAAT TA TAG CATC 7 CGTAACCACG 
CCGTTTACTC ATTGGGAAGA AGTTCCCCTT GCG3 .. ; ATC GAAGGAACG7 ^AAGGSCCAG 
TTAGAGGTGG ATAGTTTAAA ACCAGGGCTT TAT 7 ALT TC A CA VAGiATC 3 A A A7AG 7GACA 
GGCTATCTTT TAGACACGAC CCC 7AAACGA TTCA '. ". 7 TGA CA 1 AGvAATA 7 L AGCG 3ACAA 
ATTCGTGATG TTCATGTCAA AATG 3TTAAT TACCAG-. 3 3TT CT-GTGAA7T A ■" TTA.AAAAA 
GACCAAGCAG GCAATCCATT AGC AGGTGCT GAATTTTCAG TC7GTGACA7 LL3AGGACAA 
GC AGTTC GAG AACACTTAGT TTCGGATGCA AACGGAAAAG TCA7AGTGAG GG ATTTAGCC 
CCAGGAAAAT ATCAATTTGT GGAAACCAAA GCGC AAGCAG GGTACCTTTT A.AAGA7TGAA 
CCAAGTGCTT TCACGATTGC AGCAAGCGAT CGGGGCAAAC CALGAACAGT TATAG IAACG 
GCTAACTTTG TTAACTATCA AGGGACGGCT AAATTAATCA AA AAAGATGT C.-AATGGACAC 
TTATTAAGTG GTGCGACATT TAAAGTGCTT GATGGGAAGG GAGAAACGAT ^ 'A'GlACAGGC 
TTGACGACAA ATAATCAAGG G 
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\\VAY. ■ 


Ac j J Scqcuc;-, . 


■ E. 








AF HITYTTFFCY TE'~Ar"^~* 














CHYFNTAAIC 




' "■ 


X — 


'/A.. A 


■■ - v 5 


G'.' 


::-a*tf v ~ 


c: c;c;lsn 


KRCYDAFLTC 


PTITI^TYLA 








": :•".*?: 


?T 


;ttc:t:: : 


" FCEFNQNT 


WRVGFFKCSR 










■ 


G c " 




: . esvk 


KGCEYHXECP 


chvyv:hy>j:k 




y:-g;t,*v 


;cca 


• c~v 


CL ■ 




".'"EC'GTIT 




KCYTCEYCTC 








:;,?yy 


ME . 


CCLVTCC -■ 


" V CCTVSN 


C VS I TGNGSE 






CriSGGH 




- :;l 


kk: 




■,ga:-:f 3IWD 


QAKTQVCREG 














ECAKGRV : 


::: eetsae 


GAQPTI IKND 






C' : ;A 


FKCZ; : 


;,' "ttp 


ft; 


'■:eev?la : 


" ! CANGQL 


EVDSLKPGLY 


QFTEIEAPTG 








..-GQI 


rc\ 


CMAMACY . 


- CACCIKKD 


QAGNPLAGAE 










_ : cap 


G>". 


;;fvetka :■ 


.: : :.:;tep 


SAFTI AASDR 


GKPATVIATA 








" ': :ghl 


lc: 


ATFKVCC ; 


cc; t:qtgl 


TTNNQG 


















EF125-1 (S; 


:g ID NO:465 














TAAAATAAAA 


^aTTGG J.ACG 








TAT 


GT. 




CC CAAGG 


ATGAAAGAAA 






J AAATG 




.' "TT 


GC ." 


L'CTACGG 


" ; \CGTGT 


TTGTTACTTG 


q-im ^rp> * mm « 
* i \v, 1 >-l> J i-rt. 1 i. t\ 




C I A 2CA 






TG 


■ C; CAAGA i 


c: CATAGC 


GATGGCCAGT 






Z 7G AAG 


C/^-i. -.c 


•:. - 1 


AiC 


.^GAA-^T ■: 


"- C C7TAGCG 


CTTCAAGGAA 






A rACAA 




/■"TAG 


tc; 


AT TAT AG r : 


;■ c:\aatgtg 


TCAATCAAAG 






3 CGCC C 


AATAA 




gc;- 


.\GATTL'A .*. 


: rc.TATGAC 


CAGAAGAAAC 




TACA 


CTGAAT 




:,:."'C "T 


TA" 


""-GAT A" 


cr;-. ."tttta 


ACGTTGAACC 




M C 


AYFGTG 


A .: cc 


"CTTT 


AT- 


: AC T AA ?A ^ 


'.ACCGAGCG 


TTGACGTTAA 






- ZGAGA 


* * • 




CAC 


TZACGAG ': 


' CAGCC 


GCATCAGCGA 


ATGAAGGTTC 




AAAGAA 


GC AC ; ' 


:;taa 


ac: 


: ZTCTG r ? '.. 




TC CG AAGAAA 


2TGTCGCCAG 




C/ZAAAA 


gc j,v; 




GT/- 


A.AACA-^C T 


: A/-T JGACG 


ACTGTCAAAC 


CGCGCGTAGC 




TCAACA 


gat;-/: 




ATT 






ACAACGATTA 


TCGATAATTT 


TGAA 


CATCCG 




'"■''^AA 


at: 


C TG AT 3 G ; 


:;;a CAGCA 


ACACCGCCGT 


ATAAAGAAGA 


T3TG 


AC C ATT 


CAT~\- 




TT/- 


actggt: (.- 


\TTC CAGAA 


GATGTGCGAG 


AACAAATGAA 


aj:a 


jGCGAT 


TACCC 


■ "A 2T 


TT" 


.actta r : ' 


'■.r^C AATTTG 


AAACCTAATA 


AACCAGGTTC 




2ATTTA 


gttg; 


TGCAG 


AA:- 


■CCAATGT ■:' 


CC'C'_ -GAACC 


TACACAATTA 


GTGAAGATGG 


TACG 


CTTCGT 


TT7AC 


:;ttta 


AT^ 


■^'-'"'GA^ * 


CCGTCTGAA 


AGTGACATTC 


ACGGGGACTT 


tt:t 


TTAGAT 


ACT :c 


TTT3A 


ATC 


A.'ITCAGA f : 


ZGGZ GGGGC 


CCAGGAGATT 


GGGTGATTGA 


TATC 


:ctaca 


CAAGA 


;-.GATT 


TGC 


:ccctgt ;■ 


■ . 1 CTTCCA 


ATTGTCCCAG 


ATACCGAACA 


ACAAATTGAT 


AAACA 




ATT 


TTG AT TG A 


CGGICCAAT 


CCTAGTGCGA 


TTACTTGGAC 




3ATATC 


AATC A 


• c.:ga 


TG.- 


AAGAT - A A 


'.CACCaTCCA 


ACTGTGACGG 


AAACATGGCC 


AA /A 


AGGAAT 


AC TIC 


"CAAGT 


cc:; 


TCAAACT 


".--.TC AGTTA 


GTGATGAATC 


TTGATGGAAC 


AATTAAA 2AA 


GT AC- 


::~gzg 


AAV 


TTAGTTC ; 


C\TCAATAT 


ACCGTTGATA 


AAAATGGCAA 








':caca 


cc; 


'•. :aaa:i ": :: 


: A' r C GTCTT 


GAGTACCAAA 


CGACGATTGA 


CG AG 


CTjGTT 


ATT A. 


ATG 


GC. 


r jggc jA r : 




AAAAATCACG 


CGACGTTAAC 


AAGT 


CATAAT 


AAT "C 


■CATG 


GG" 


':agatgc 


' •. •'• VAACT 


GTTACCGCCA 


CATATGGCAA 


AATG 


TTAGAC 


AAG A~ 


. TATA 


ta: 


.CTTACGA 


CA CCAAT 


CAAGAATTCA 


C TTGGG AAAT 


T.AAC 


TACAAC 




C/AAC 


AA.- 


::attcc 


"AAA- ACCAA 


GCAGTCATTA 


CAGACACAAT 




"ATAAT 


TTAA' 


"■"'TTG 


AA' 


:agattc r : 


CC/-. ATTTA 


TATTCAGTGA 


CATTTGATGA 


CAAA 


3AAAT 


GAAC: 


:t-ttg 


GA " 


" -GAA "T : 


"■ AAGGA 


AAAGATTACA 


AAGTGGTAAT 


C Ajs '.J 


jGAGAC 


GGTTC 


"TTTG 


CA.--- 


TTG AC TT : 


cta: atgat 


GTGACTGGCG 


CAGTCAAGAT 


TG ATT AT AAA 


ACC^ 


AC TTG 


at:- 


GAATTGT C 


^AAGGCGAT 


GTTGCCGTGA 


ATAATCGTGT 


GGAT 


GTTGGC 


A'JTCC 


TCASC 


ATC 


:a3Aaga c 


•ac:gcaca 


GCCAGTCAAC 


AAAATATTAT 


TAAAAACACT 


GGTG : 


.-■.GTTG 


at: 


ATGAA.AA C 


C CCA CG ATT 


GGTTGGACGT 


TAGCTGTGAA 




AATAAT 


TATTT 


VATGG 


AA.-- 




CC/.CGGAT 


ACGTACGAAC 


CAGTTCCT3G 




ACTATG 




; :. i r p r jp 






C-AC7JATACA 
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ACCACTGGTG 


CTCAGTTGAC 


GTTAGGCAAG 


GATTTA 


AT AG 


TAG A 


aataa: 


T 1 




•■.ATGCA 


GATGGTGAAA 


CAGGCTTTAA 


GG TAAGTTTT 


ATAGG ' 


a a at 


ATGA 


:aaaac 


A .■' ■. 


AT 


AATGCC 


TTCCACATAA 


CTTATACTAC 




"TCGAT 


gttaaa 


AA AT 


TAG A 


A 3CTAA 


CAA 


a:a 


AATGCG 


TTGGACCATT 


ATCGAAATAC 


CGCTt 


A A CAAT T 


G ATTG A 


a a ag 


ATGA 


A 3CAGG 


A. A. 


AC 


A ATC AT 


CATTCAGAAG 


ATAGTAAACC 


GTTTAAACCT 


tta : ■ : 


-ATT 


TTGA 


TTTAA-^ 


1A 


AC 


: A '.AAAA 


AGCGGTGTTT 


ACAATGCCGT 


CACCA 


vAAGAA 


ATCACT 


aaga 


CGAT 


TGCGGT 


T, 




A.aAGT 


AATAATCGTT 


TAGTCGACGC 


CTTTTTGACG 


GAT aaaa 


AATT 


TAA : 


A. AATC A 


AA- 


AC 


TaTTTG 


GCTGGGAGCT 


TGAAAGTCTA 


TGAA< 


jgcaat 


ACAAAG 


aaag 


atga; 


TTCGGT 


T^ ■ 




a.XAGTG 


AAACCAACGC 


AACCGTTGAC 


GGAAA 


.TCACA 


ATGGAA-. 


aaac 


CAAG 


CGAGAA 


AA. 


A. A 


AAAAAT 


ACTTGGCGTG 


TTGATTTTCC 


TAAT 


GATA.AT 


cgta: . 


:atg 


TGAT 


TGAATT 


t r 




a :gtct 


GTTGATGAAA 


AAGTTATCGA 


AGGTA 1 AGGCT 


AGT i ,, . 


- ,AA 


ATA : 


AGCATC 


T 1 : 


A'] 


'. _'AAAC 


CAAGGTTCTT 


CACGTGATGT 


GAGA' 


A AAAA 


gtt::: 


ATTC 


AAC A. 


TGGTG A 


c^ 




' AGTG 


AAAAAAGGTG 


GCGAATACCA 


CAAA, 




CCA";-/: 


aatg 


TGAAA 


:tggca 


i''- ' 




ATG ATC 


AATGGCGCCC 


AATCGGTTTT 


AGA'A 


;atgtg 


GTTATT 


ACTG 


ATA 


-■^cct;: 


A 1 


A'A 


A ag CAA 


GTGCTAGATC 


CCGAGTCATT 


GGTGATTTAC 


GGTAA-: . 


AAGG 


TAA 1 : 


AGAAGA 


C A." 


AAA 


ACTATT 


ACGCCAGATA 


AATCTGTTAT 


TTTA 


AAAGAA 


G GAAA. A 


A ATT 


AC A A 


ACTGGA 


AG 


TT 


a.GCACC 


GATAATGAAA 


CAGGACAACA 


AAAA. 


VITGTC 


gttaaa 


AC AG 


CCCA 


T ATTG A 


a: ■ 




"GTTAT 


TATATGGAAT 


ATCGTAGTTT 


AG TO/ 


gattct 


tcagag 


aggg 


GGAG 


TACAGA 


C;-. 


CG 


A TATC C 


AACCAAGTGT 


CAATTACTGG 


AAA'}' 


; ATTCA 


gaagta 


attc 


ATGA- 


:gatga 


c:-. 


A.T 


a A CGAT 


GTGGTCGTTG 


ACATTGATCA 


CAAT 


; acggg 


cat::: 


a:: ag 


gga: 


T.AAAGG 


c: 




■TTCAG 


CTGAAGAAAA 


CAGCCATGGA 


TGAG. 


...:gact 


atatta 


aaag 


GCG A 


AC ATTT 


C : . 




ATTTGG 


GACCAAGCTA 


AAACACAAGT 


CC AAA 


1 ATAAA 


ggta:a 


• : ag 


ATG - 


A'ACCGG 


G- . 


:t 


a. TCAC A 


TTTGGTGGGT 


TGCCACAAGG 


GCAA 


AAC ATT 


TT A ATA 


AAAA 


CAAA 


A 3CAC A 


A: 




a ;CTAT 


ACAGTTTCGG 


ACGAATTAGC 


TAV-- 


ACCCA 


gt~v»t- 


a TA 


TTGA. 


A 1 3 AAG A. 


AA 


".' A 


A AAGCC 


GAAGGAGCAC 


AACCAACCAT 


TAT A. 


A .AAAA C 


gat at a 


A ATA 


AAGT 


ATTTTT 


j~> ■ - 




AAAATG 


GATGAGAAGG 


G TAAAAAGTT 


acta 


AATGCT 


cg:ttt 


a: AT 


TAG A 


AAATGC 


C'- 


TA 


A -CACG 


CCGTTTACTC 


ATTGGGAAGA 


aata" 


:_CGTT 


gca::g 


A ATC 


GAAA 


:;^acgc 


g: 


::i' 


AGCCAG 


TTAGAGGTGG 


ATAGTTTAAA 


ACTA. 


AGGCTT 


TATCAC 


err a 


CAGA 


A ATC G A 


Ai- 


A' ^ 


:.: :gaca 


GGCTATCTTT 


TAGACACGAC 


CCCCAAACGA 


TTCATC" 


ATGA 


CACA 


AAATAC 


GA- 


AC 


AGACAA 


ATTCGTGATG 


TTCATGTCAA 


AATC- 


:ttaat 


ta:caa 


a gtt 


CTGC 


TGAACT 




TT 


a„AAAAA 


GACCAAGCAG 


GCAATCCATT 


AGCA< 


.-■ j'l'GCT 


GAATTT 


t:ag 


TCCT 


TGACAC 


CA 


A .A 


A'AACAA 


GCAGTTCGAG 


AACACTTAGT 


TTCC- 


AATGCA 


AACGGA 


A AAG 


TCAC 


AGTGAC 


GA 


.-A.T 1 


T AAGCC 


CCAGGAAAAT 


ATCAATTTGT 




ACCAAA 


gcj::^ 


J AAG 


GGTA 


A ATTTT 




A- A! 


AATGAA 


CCAAGTGCTT 


TCACGATTGC 


AG-J/i 


* aL'GAT 


CGGGCC 


■V«AC 


CAGC 


A AC AGT 


t: 


AA 


A AAACG 


GCTAACTTTG 


TTAACTATCA 


AGAA 


a agact 


aaatta 


A": A A 


AAAA 


AGATGT 


g: 


A.T 


A GACAC 


TTATTAAGTG 


GTGCGACATT 


TAAA 


a agatt 


gatgag 


,AA AG 


GAGA 


AACGAT 


T 1 


A A 


A AAGGC 


TTGACGACAA 


ATAATCAAGG 


GG A A 




gcagaa 


a a at 


TAGC 


a:cagg 


A' 


"'.A: 


A.ATCGC 


TTTGTAGAAA 


CCAAAGCGCC 


aa:a 


A ACTAT 




A. ATA 


CCAC 


A :CAGT 






TTGAA 


ATTGCTGAGA 


AAAATGCTGG 


TAAA 


::agcg 


gtagtg 


ATTG 


CTAC 


T AAC AA 


c: 


TT 


A- AG AGT 


TACAAAGGGG 


CTTTCCAAAT 


CGTG 


AAAACG 


aata aa 


AAAG 


acc; 


h :CATT 


A 1 " 




A GTGCT 


GTTTTTGAAT 


TATATGATCA 


CAATAAACAA 


tcatta 


AC-GA 


TTAC 


A 3CAAC 


G" 




A ^CAAA 


GATGGCAAAA 


TTATCTTTAG 


AG AC 


ITGGCG 


ccaggt 


ACCT 


att; 


TTACAA 


AA 




AAA CAAA 


GCACCAAAAT 


TACCAGATGG 


CGCA 


j ATT AT 


attaat 


TATC 


ctg; 


ATT AGT 


A : 




G AAGAA 


ATTCGTGGTG 


ATTTCAAAGG 


TGAT 


.: aggag 


at': t a 


AAAT 


tag: 


A GCCTT 


c: 




AATTTC 


AAAGGACGCG 


CCGTCTTTAA 


GAAA 


ATTGAT 


GCC£A~ 




acc: 


A A TTCC 


A 




a, :gatt 


TTTAAATTGT 


ATCGAATCGA 


AAAA 


AGGAA 


aaaaaaa 




AAAA 


■AGAAGT 


A- 




G -TGAA 


AAAGATGGTT 


CATTGGCTAT 


GGAG 


AATTTA 


ggtcct 


ACTA 


GCT; 


:i 3AATT 


A' 




AACTG 


GATGCAACGG 


ATGGCTATAT 


CCTA 


A AT AAA 


CAACAG 


ATTT 


ATTT 


TATAGT 


G.- 




A. AGAAT 


TCAAATGATA 


AACAACCACT 


AGAA 


l-AGTTA 


GAGTT ; 


AAAA 


ATT7 


.TCAAGC 


AC 




AAAATG 


GGACGTAAAG 


TCAACGAGCA 


AGGT 


I'AAACC 


ttaca: ' 


•• : AAG 


caga 


"ATTTGC 


Aj 




TACAAT 


GCCGATGAGC 


AGAATCAGCC 


CCAA 


A ATTCA 


ccgat; 


A A AT 


tct: 


■GAATCG 


T: 




g -AGAA 


AAAGTTTCTG 


AAATAACAAC 


GGAT 


A AG ACT 


ggagaa 


--rprp 


ACG" 


"T-vAAGG 


g: 




a. ATGAA 


GGGCATTACG 


TTTTAGTGGA 


AAC AAAJVACA 


CCAAC- 




atc: 


"AT TAG A 


c: 


.A-V.AGCTA 


CATCCATTTG 


ATGTAACCGC 


CCAA 


f JTAGGA 


AAAGAA 


:agc 


caa: 


-TGCTTT 


AC 




GATCTT 
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TABLE 7 Nucleotide and A::.;:. .\.\\ : ...gv Genes 



GGTGCGG7GT 77AAGG7GA7 : • " . GGG GA G G - ~ GG 7A GGG GAG A- A. ATGTG 

3TGGAGACAC AAGGGTGAAC GAG TGATGG'T GG"G; G.AA G 3 ": "AAA 3 "GG A ■-. ""TTACG 

attgccaaag a:aacgaagg gaaggpaggg a: : ; tag 77gg aggagg i":"r: ggttaat 

TArCAAGGTG C7^:GAAGGT GGG .AAA ATT GAT AAA AGGGGGGGTG AG, . GTGCT 

GAATTTAAAG TGAGAGATGG AGAGAGAGGG GAG—-. GGG GT ... G TGAGT- ~. . ... . GGTGAC 

AAGCAAGGGT TA3TG7GAGGT GGG ; G G ' TTA G.GGG.. GA_A AAGATAGGTT "G - -AAACA 

aaagcacggg atggtgagga agtgt: TAAG GAAG GG 3 GA"GGAGTAT T.:l. GGAACA 

GCGAAAGACA AAGGTGAG.GT GGTGAG7GGG ZZ:;.' r ' TT3 TTAAGGA3AA A 7 G GGTGTA 

TCGAAAAAAA CAAAA-. AAA. A TGA, 7-AGA A" G- • GA G CAGG r GAGAGA GG G GTTGG 

CTTGGTTTAC C 3AAAAGG AA CAGA GAAG7C AATGG-. : TGT TTGTGTTTAT G "G-G "TCATG 

TTGGTCG GTT TGGGAAGGTG GGG GTG I* TAT A-AAJ, ■ GGGA AGA_AATAA 



GF125- 



i 3 EQ IT ::G : 4bc ; 



MRKNGPMV MRWEYGEM 
GQLTLGEVKQ 
KKQIQIELNQ 
SANEGSTEEA 
TIIDNFEDPI 
PNKPGSGDG7 
GDWVIDIPTGJ 



LLVGG' 

TSG^'EHTLAL G;GG. 

QALAUTLVLT ENP' 

STN3 r P KSG GG';"' 

YENPGG7PAT PPV: 

EAEGIC/YGTY GGG: 

ECLPF^ZPI VPE' 



MAG A 

;:e V" 



:h wp: 



GK C 



vtetwptg::t fksvkvyelv 
yqttideayi pdgggevpff 
eftweiny::y gectipkpqa 
dykwingpg efaitglhdy 
sqqni ikntg avey'p-gtig 
tgaqltegkd fmve i g'pggag 
dhyrntaa1d wteeagnnhh 



NRLVDAFLTD 
WRVDFPNDER 
KGGEYHKDCP 
PDKSVILEEG 



pilt:;t7yla 
7yyieff7sy 

kdytleyttd 



QVSITGNGSE WHGDGNGDV 
QAKTQVLREG TVDATGVITF 
GAQPTI IKKD YNKVFGEKMD 
EVDSLKPGLY QFTEI EAPTG 
QAGNPLAGAE FSVLDTTGQA 
SAFTIAAEDR GKPATVG ATA 
TTNNQGEIVA EHLAPGKYRF 
KGAFQIVKTN SADQPLAGA'/ 
PKLPDGADYI GYPELVKVET. 
KLYRIENGEK IFERE'GTAEK 
NDKQPLDELE FVr JYC AEVXG 
VSEITTDKTG EI YAK GL'JEG 
NYQGTAQIjTK EIIETGEALAG 
ETQAPTSYLL NETFSASF77 
FKVTDAETGQ TVARSLR5DIJ 
KDKPELVNAG TFVNERGPVG 
VGLASWLFYK KSKK 



1^ I- 
GShr" 



;pl FA 



ueg'ggg; k gw km; 
vtgtgggggh atc 
gglpqgqyil ye: 

EKGGGGAT^AR FKG 

vreheygdan ge* 
nevg :gg -tax eg:- 



ft: 



FELY GH NF n 3 
RGEFGGGEPE I 

EKTMEGyGGpTE AG/ 

HYGGL7E7EA? TG' 

aveggmeetg £7"' 

aktgggkpat vv: 

Cgltqynhlq pg: 

k>gt?'?;;g,?tt k;..- 



'G/5 :k . 

:-AL 7LG7' 

GTT YKG i 

GED VRG7 

PUP sa:t' 
eyt ydgg 
atv tagg 

GLY SY7F 

ggy ay:g: 

TGT YEP'" 
GAF H I TG 
7>KS GVGG 

gg/x pt;g 
■t:cq gsgg 
:-;rv lege 

?Y Y MEGG 

t;.-l kkt;, 

GYT VSCE 
TTP FTHV: 

■Gvi rig::-: 

.GAP GKYG 

:ghl lsgg 

FEI AEGG 

g?:d gk:: 

GIEK GR,/ 
ELD at: 
TiTIA DEC.G 
: TEH PFGG 
TILM SDGC 
:riY QGAGA 
ETK APT G 

. g.;l glg- : 



iWAAPI-: 

: AG PTE 
GKA3TY 

:fsltt 

■GrGDIG T 

j itgtig: 

:;kmld>; 
";gfgxe 



gtffgt ti 

^VTKEi T: 
GTETTM E; 
GYTGKV S . 

slyiyg t: 

3LVTSS A, 
GEETTI L, 
GAKGFV I' 
EEVPLA PI 
'TaOLKY Q; 
TYETKA P, 
TFKVLE A: 
AGKPAY \" 
GRDLAP C 

g?:kii a IG 

YIYN?'v P 
■^PQGSP I' 
TAQLGG E: 

gkvi;^. IT 

TLVKID G 1, 
YQLSKQ A 1 
TNTQYK Y 



GGYDDQ 

attsspaa 
: . : : gdet 

G, GP3NLK 

::: ^ gggrgp 
gg g;tnpt 

"A AG/RLE 
: GSANQ 
I / 'EGK 
G GGDGTA 
:g: " "VKDTT 
::g- aijupal 

: iG GVIJLSN 

:ggeknqnt 

7 G GGESVK 
r/TGDGTIT 
.GGTETVSN 
GAAliFQIWD 
-GEETSAE 

-rgt:angql 

••: GIKKD 
,GY GLNTEP 
'.GETIQTGL 
'A- GriFVSY 
G GGEEIKA 
GG GPGTIF 
:.YF ■ "Y'KKNS 

:gggeagek 

G- : AEGDLI 
.GGGTYRFV 

g-.ggagae 
;g-g:aata 

:• . GGLML 
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TABLE 1. Nucleotide and Air.inu Acid ScqeixiA • of E. faecal is Genes. 



TAACTTTG TTAACTATC A AGG AAA AGCT AAA77AA . 'A AAAAAGATGT GAG77 
TTATTAAGTG GTGCGACATT A AAAGTGCTT GAT-..-.-. J^GG GAGAAACGAT TG. 
TTGACGACAA ATAATCAA3G GGAAAATTGTT GAAA- . . :ACT TAGACCCAGG AA-_ 
TTTG TAG AAA CCAAAGAGAC A^CAGGCTAT TATA TV ." AATA CCA "GCCAGT C<7 
ATTGCTGAGA AAAATGCT3G TAAACCAGCG GTCC' r : A ^TTG CTAGTGACAA CT r 
TACAAAGGGG CTTTCCAAAT :GAGAAAAA-gg A. A A A 5CAG ACCAACCATT AG 1 
GTTTTTGAAT TATA TG AT - A 7AATAAA A AA TCAT'A" AGGA TTACAGCAAC GA: 
GATGGCAAAA TTATCTTTAG AGAATT3 3CG CCAG '"" \CCT ATYATTACAA AG. 
GCACCAAAAT TACCAGAT3G "AAA AG ATT AT ATTA'A" TATC CT TAATTAGT AA 
ATTCGTGGTG ATTTCAAAGG T 3A7CCG 3 AG ATTTAAAAAT TA Z :;GGCCTT CG 
AAAGGACGCG CCGTCTT7AA GAAAATT3AT GCCA, v ACGA AC Z AACTTCC AG 
TTTAAATTGT ATCGAATCGA .AAA:GGG3AA AAAG7A TTTG AAA 3AGAAGT AA 
AAAGATGGTT CATTGGCTAT G3AGGATTTA GGTGACGGTA GCTATGAATT AG 
GATGCAACGG ATGGCTATAT CGT 7AATAAA CAACAA ATTT ATTTTGTAGT GA 
TCAAATGATA AACAACCACT A 3A A: AGTTA GAGTAA 3TAA ATTATCAAGA A A 
GGACGTAAAG TCAACGAGCA AGGTGAAACC TTAAA AGTG CA ^TTTTTG Z A A 
GCCGATGAGC AGAATCAGCC AAAAAGTFTA ZZ G A T A AC AT TCTTGAATG3 TA 
AAAGTTTCTG AAATAACAAC GGATiAGAAT GGCTA.ATTT AC:,ATAAAG3 GA 
GGGCATTACG TTTTAGT3GA A AT." 3AAAG ..A CCAAA GGCT AT3FGTTAGA CA 
CATCCATTTG ATGTAACC 3C CAAATTAG3A AAA: .. ;AGC CAAATGCTTT AA 
ATCAATTATC AAGGAACT3C TG A ACT A A AC A A. AG.A ■ AACG AAA AAGGT3A AA 
GGTGCGGTGT TTAAGGTCAT T 3 ATGAAAC A GGA ,.• iCCG TA3ATGGACA A .A 
ATGTCTGACA AGCAAGGGAA AGPTATTGVG AAA/-. A TTAG CAA3GGGAA: GT 
GTGGAGACAC AAGCGCGAAC TAGCTATGTT CTTA;a ^AAA CGA7AAGCGA AA 

attgccaaag acaaccaagg caaaacaaac actg7 '" 3tac tt.a aagcac - 
taccaaggtg ctgccaag:t g cagaaaatt gatca :aaa agaatgcctt aa 

GAATTTAAAG TGACAGAT3C A3AGACA3 3G CAAACTGTCG CTC 3TTCATT AC 

AACCAAGGGT TAGTTCAA3T G AA A AACTTA CAA ' A ;GAA AATATACCTT TG 

AAAGCACCGG ATGGTTAC -A AATATCTAAG CAAG A : GTCG CAT TG ACT AT TG 

GCGAAAGACA AACCTGAACT Z 37 AA7ATACG GGCA A TTTG TTAACGAGAA AC 

TCCAAAAAAA CAAAACCAAA TC^ ACCAACA ACGAA.-CAAG CA A A TAG AG A GA 
CTTGGT 



TA77; 



.7AACAC 

AACAGGC 
.ATATCGC 
AA/TTGAA 

A" :TGAGT 

aaggtgct 

APGGCAAA 
.AATCAAA 
,A: CAGAA 

_aaaatttc 
aaacgatt 

• A >CTGAA 
A7GAACTG 
AAGAAT 

,A. ;taatg 

" "ACAAT 
AA AG AG AA 
AAAATGAA 
AA ACGCTA 
A AATCTT 
A ATGGCA 
AAATCTG 
■ 7AAGTTTT 
AA7TTACG 
TTAAT 
vGGTGCT 
^TGTGAC 
AAAACA 
AACAACA 
vCCTGTA 
, AGTTGG 



EF125-4 (SEQ ID ICO; 4 68) 



NFVNYQGTAK LIKKDVKC-HL LSGATFKVLD 
TTNNQGEIVA EHLAPGKY P.F VETKAPTCYL 
KGAFQIVKTN SADQPLAGAV FETYDHI IE CjS 
PKLPDGADYI IYPEL-VKVEI FGT7FGCIEI 
KLYRIENGEK IFEREVTAEK L GAL AMEL LG 
NDKQPLDELE FVNYC'AEVTIG KPAATEQGCTL 
VSEITTDKTG EIYAEGLEEG KY* AT 'ETt 'AP 
NYQGTAQLTK ENETGEALAG AA A PA ' I A ! 'TG 
ETQAPTSYLL NETPSACFTI A.:*"_A A, ■■.'=}' ]' .A7 
FKVTDAETGQ TVARSLRP-DN OZ-ZV A7ATIGQ 
KDKPELVNAG TFVNEKQPVS AA'A- A aA^ATT 



AKGL7 

LVZlT: 

lgit; 
ftgg; 
agsy: 

AGA^'l 



PC 7 
ETA 



■QTGL 

"PFEI AF? I-jAGKFAV 

ASGKD GYZ ZFRDLAP 
;JSTFK GFAvFKKIEA 

.GELD ATAGYIVT^Q 
I YNA DIU^^PQGFP 
TTLH PFP7TAQLGK 

; TNLM SDKTGKVIAE 
TITTY QG AAKLVF] D 
"'ETK AP7GYQLSPQ 
AGWLG 



P.uNFVSY 
AYKEIKA 
P7PGTIF 
r T,T<KNS 
LMRAGEK 
7ALGDLI 
PTTYRFV 
: 'AAjAGAE 
ATIAATA 



EF126-1 {SEQ ID I JO 4 6S 



TAGCGAAAGA AAATAGGGAG CAAA^A.TAAATG TTT." 
ATGGTGATTG TCGCTGG.AAC AG77GTGG 3 A AAT'I 
GAAGCGGTTA AAGCAGGAGA TACAGAAGGA ATG/ 
AGTCTGGCTG ATTGTAAACG GA7A7TG3AA GGA Z 
GAAACGGAAC CAGTCGATTT AG7AGTT3TT GA_A: 



■iAAG C AA C G AAA T T ATTATCGACA 
AGTC CCACATTGGC T'l'T'AH CTGAA 
:\ATA CGG7GAAAGT GA7.AGACGAC 
ACTA CTTTCCCAGT TCA.AG TGGGT 
'3CTA GTGGTAGTTT T A AAATAAT 
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TABLE 1 \uc"eo*'.l" - : 












hii - 


















./^aG 






G A A 


A A.GGGC 


gtgatgctg ; gtg : at; "i' ; g 




■GAGGG-. 






■-TGT 


*7y y ■ - - 


"GGAG'AG 


AAG 


AAG AAAA 


attaattca:; gtgagG/Gg.a 




G-. : .- AG 








GG .A 


■.ATTA^ A. G 


GA 


A AAAA 


AGCCAATTT G TCCCCG.. A G : 




G-. A A A T 






;atg 


GTG-' 






GGCCA 


G< j.hT?' jA"j".' A CG^G. . . ~. -.*-. 




A ' .-AG. T 






AA " G 




.TTTAG'. G 


GA - 


" A .AAAA 


A G G T A * * v : _ G G . TT v . -. ~ 




: " 


a : ■ 


: gg 


■AG AG 


GTG' 


AAGATGG 




G TGCAT 


AAGAG GAAGA CCAA/.--A- AG 




AAA ".AA 




. , , 


-/-iT - 


GAG. 


.-AG/*- - . _ 


r- 


. GAAGTC 


TGAGTGGAAT AGAGTAG.CGA 


AG. 


A. A. A AT 








AAG' 


, i'/-. j-....' 


G A A 


aaa:caa 


GAAATTA GTA AG G A.A G G" G GA 


A 




;ga 


AG ' 


, .-iTT 


GGG. 




G~ " 


AG GTTTA 


AGTTCAGTGA ATTCATACTT 


^ G ~" 


A A.-. TAT 




AA 


AAA 3 






G.. 


a. AAACAA 


GAG TTG C AA 'J AAG G G G G ' T A G 








- - - 


"-.TTA 








A GATGAT 


TTTACAAGCC AATTAAAACA 


ag.g a 


' J. ~ A 




GG 


:tgg 


G i A 'A 


GAT G GAG 




A : AAlACA 


GGTTCATTAA GGATT G * G AA 


G GAG: 


"TTG. AT 


AG 


AG.A 


, :gtg 


GG.A 




G ./. -. 


ATGCT 


GGAAATG ATG 7GC G 7 A G G 1; : A 




AAA,. A A 




'J: 


. - /rrT 


GAG 


G .AAG TAG- 


T- ■ ' ■ 


. AAAGGT 


TACGTAGGAA AGATG A G/AG 


~ y .\ - 


GA A AAAA 




"AAA 


- "AAAA 


AG A. A 


GAG GG- AT 


T '■ : 


' -AAGCA 


ACCCTTGTAA GTAGT GGG.C 


a a . g 


AAG AAA 


-G 


A A 


-. '-.TTG 


GTG- 


AGGAA.TT 




. .AAAGCG 


ACGATTC :TA AAAAT GG^G" A A 


. G . - 


AAG G 




;ga 


:TGA 




. .--1 _t AA 1 


i ■ 


.GGATT 


AG AAAAG A T A T GG AAAA T C A 


AG.-' -.A 


. " A. G TT A 




"T" 


-. ACCA 


ATC- 


G-TGAAGA 


TA-. A 


GTTGGAT 


tggcatgtga aaaca.-ggtt 


; ': : • 


A A AGAAA 




GA./ 


ACTT 


GGA 


"GG GAAGC 


C AA- 


"AA 'TGGTG 


GATGACATTA ATAAA ;T g:T 


A GAT 


~ ■ i ~ ^ 




AG r " 


AAAG 


T'A.A 


."GGAG AA 


AA A 


A AG TAAA 


GATGTTAGAG CTAkC TGAC 


A.G GGA 


AGAGAA 


G / 


GAA 


,"AGA 


A.AG' 


G.AA GTTT 


G A- : - 


A A.G GAAC 


AAA C AAG GAG A G A G G T A T G A 


■ - ;. ~ 


AG A AG G 




gg; " 


~ A GGT 


AAAA 


AAT-G/AG 


GG " ~ 


AGGACT 


AAAATTAAAA CTGAG.GGw.G 


GGAG 


GAAGAA 


TT 


ag ; 


:gtt 


AAA 


GT GAA GA 




GG GATT 


CGCAACCAAG CCG ACTA AAA. 


A'G'A'A 


GG AAA AT 




AG " 


AAGG 


T AA' 


AAG ATT A 


GA ■' 


. .A A.AC C A 


AGCGTAAGAC CACCGGCAGT 


GGA.G 


A CAAAT 




GG '.' 


. AAAG 


A AG 


GAGAAGG 


A '-■ 


A AC AT 


TT AG ATTT AA C G AAG " G ; G G A 


r : C.A.A 


AAATTT 


••-A. 


ATA- : 


; AAGG 


TAG 


AAACA G A 


T . . 


A.:TAAC 


GAAACAAGCA CTTGGAGG GA 


.\ 3 ■ ■ 


. . ■ A~G 




ag; 


A, AG A 


TTA 


•-.TAAA G'l 


C- 


A A GATC 


ACTGATGTAA AAGTGAGAGA 


GG .-GA 


AATGGT 


.A A 


\g;. 


. A TTA 


CAG 


GTAAG A"; 


G ■'■ "■ 


A G 'GAAC A 


C AAG AAAAT A AGAAA;AGAAC 




'A. A AAAG 






: AAAG 


CGG 


AGAGGTA 


"*' : i. 


- "ATTTA 


AGTGGTCATA CGTACAGAAG 


GA GG 


^ j. 




AAA. 


,ATCA 


AAG 


ATAGGGG 


A- A 


GAAGGAA 


GAATTAGGAC CTTATATTGA 


AG - : - A 




A ' ' 


GGG 


AAACC 


AAG 


GGG ACTT 


g ;■.;■. 


A'GT TGGC 


AACGAAGGTG AGGTGTTG GA 


GGGG 


~G-.G -AAA 




AA 


~ GTAA 


CAG 


G AC GTG G 


a 


AACGCCA 


GAAGATCCAA C3ATTAGAAA 


AG .AT 


;ggggaa 


G 


GC/A 


A GAAC 


ATT 


TAG ATTT 




GAACCGT 


G A C C AAG A AT T T AAA T G . 5 A A 


A AAAA 


AAA AC A 


A" 


A'TA" 


AST A 


AAG 


AAA C AAG 


GA" 


A AG AC C 


CAAGCCAGCA TGGTG GATGA 






G" 


GT" 


A A AC A 


T G -A 


GAGACGT 




A.GTTNCT 


G ANG AAAA TG GCAAAGATGT 


i r\ - , /-'. 


AAA T A AT 


" 


- — - 


A ATAA 


"-^ - 


.AAGAA.AA 




G .AAGTA 


ACTTTTACTA TGAACAAAAA 


GG-T 


G- *" pier 1 


TA 


GT " 


AGACT 


TAG 


GTGGTGA 


TA. A 


AGAAGACA 


ATGAC TATTA CCACTAAAAT 


TAAG-. 


AGTGAG 


j _ 


A-AC 


AGATG 


MG 


A. A TTA G G 


G"A 


TTATATT 












' " "TTG 


GG : - 


A G G AA A G 




' : TGTTG 


CATTCCAACA AGCCAAGCGT 


/"S-A ' - / *- 




g: 




AACGC 


CAG 


A.AG AC G G 


A.A.: 


.A A„AACCT 


GAACCTAAAn A Ippp "T'*i A A 
ACGAACCCAG TCAATTTTGG 




A A^ A AAA 




GTG 


GACGC 


CT -A 


CAAAT G A 
ATTTA GG 




AGCACCA 
• a.A'TAAT 














A " 


ACAACAGTAA ATCCAGCGGA 


"A A ' 


.ATGAGC A 




GAG 


A A TTG 




TGATAGT 


G ; ' ■ G 


TATTAGC 


TTTGGCATAA C AAA AAA TAJ* 


AAAA 


AGAAGtA 


AA 


GTA. 












EF126-2 (SEC IC N" : 47C 




















MF KKATKLLSTM VIVAGTA- 




GGGLALAGG 




A. AG DTE GK 


t::tvfv"k 


GG G' 




LADCKRILEG QATFPV;AGE 




GGGGGAE 






~.;dnf 


P H " ' 


. A G G A A' 




: GQ DRV 


MLASYRGGKQ FMFFDGXTKI 




YCAGGGR 






AGDK.S 


CFV 


GGFGEVR 




AGPTAPG 


LKLALDTYIJQ THGDLTKRKT 










"-GLHK 


TKG 


; GG* I NEA* 




: G LQVS 


VEYSNDYQGA AAEVLAGGQE 










A'GSLS 


sv: 


GYFDKYK 




■•PFVKQE 


LQQGSSTPED FITSCGS G GDF 




; G ' A-'D 






■ -"'ATA 


SLA 


G.AGACF'DI 




GATDDAG 


NDVPVQINGQ TISATGTEGY 










' GAAT 


lvg 


GGTIGTAGG 




A GPEAT 



WO 98/50554 : : VUS98/08959 

TABLE 1. Nucleotide and Avr.v.w Acid Seqeucnce.- of EJaccalis Genes. 



IPKNDNAHAC DVTPEDPOIT EE:EE;EKEE 

DINKVLDIID VKVTDEN AKD VO AI IGTVTQE 

IKTDATDEEL APYIEQGGIP EIGE '.ENFGNZ 

DLTNRDQEFK WNVKTAF 3NE OEOAATAAGMI 

ENNKVTFEMN XQADSYDGES GATATE10ITT 

EGDVLHSNKP TVTPPAPTPE EAIE ""KOI EG 

ASMVDDINKV LDITDVKAAIX EEOEEVTDNG 

TITTKIKTDA TDEELAFYIE Q 'AG I PNOAOE 

PKQPLKPKKP LTPTNHQAPT IIAVGEGEGA^ 
GITKNKKRKN 



eteeea afdw hvktafgnet so'. "i". asmvd 

: :ink qat sydylsg hooa'etittk 

geoke. ekpt vtppfvdpni ae: -agqehl 

dee git dvkvtdengk e^o a gkvtq 

ige-g oee lafyieqgg: pe g : emfgn 

C fee; ::rd qeekeinvkta fa: . etwtq 

i ae a: :kvt ftmnkkddgy s y i ••• etytm 

EFGEOEGEVLH SNKPTVTPPA PTEEAPEKPE 

y, r. ::• eont tvnplymiag eg : : iaisf 



EF126-3 (SEQ ID NO: -471) 



TGAA 

gaagcggtta aagcaggaga 
agtctggctg attgtaaagg 
gaaacggaac cagtcgatot 
tttccacatg taagagaagg 
gtgatgctgg cttcatat ig 
attaattcag ctgattat3a 
AGCCAATTTG tctctggttt 
ggattgaaac tcgctttaga 
acgtatttcc tattagggai 
aagaccaata ccaatgattg 
tcagtggaat atagtaat3a 
gaaattacta accaagg ata 
agttcagtga attcatactt 
gagttgcaac aagggtctag 

TTTACAACCC AATTAaAAGA 
GCTTCATTAA CGATTGCCAA 
GGAAATGATG TGCCTGOTIA 
TACGTAGGAA ACATCACGAT 



t a a. a o a. a. ;aga ate a-: a ata 
gataot3 3aa gga ia a acta 
a atagtt jTT gaaia ;ta 
gaoigatgaa gtggtggaag 

C "AIEAGGAAAA C.-EA rTOA.TGT 

iaggaatitg c -eai . .at a 

T ZZhGkC ITT C 3GACG GATG 
TACGTACAAT CAA.AGA A ACG 
A AAATGGGGTC GPTAATA. IAC 
AATCAATC-AA TATIIAAATC 
CTAGGAACGT (G'AGY.OAG 
TIAAATIATT A AT 3 I GO ATT 
T3ATAAATAT AAAAIAGAAG 
C AIAACCA3AA GATTTOATTA 
AAGTGTIAAA GAI IGOGTGG 
TIAATTTGAT ATTIGAOCTG 
AATTAACGGA CAAAIIATTT 
I IAETAGGAA GO: V*. AG AAA 



CGGTGAAAGT 


GAA. 


■.GACGAC 


CTTTCCCAGT 




< :i:GGGT 


GTGGTAGTTT 


T' : ' " 


ATAAT 


GCTTATCTGA 


TGA. 


AACCGC 


TOO IOGATGG 


AAA 


GAIAAAA 


CGCAATTGAC 




: IT AAA 


GTG 3TACGC Z 


A A. 


GICCCA 


GAGATTTAAO 


OGG • 


I 3.AAAA 


GTTTAGATGG 


TO'-. 


OEGCAT 


caagacatcg 


TGI" 


G AAGTC 


AAGTTTTAGC 


G". . . 


GA ACCAA 


GGGAAAGTGT 


T A A. 


at::ttta 


TG C 1 - TC C TTT 


TIG' 


\AAACAA 


CAAGCCAATC 


TAT' 


"GATGAT 


CGC.AATCGAC 


A I A 


AG IAACA 


CGACCGCTAG 


G ■ * 


. ATGCT 


cagiaactae 


T : I 


GEAAGGT 


ATAC AGCGAT 


TIA 





EF126-4 {SEQ ID NO: 472} 



ee avkagdtegm tntaikvakde g 

ladckrileg qatfpvca 3e oee"eew\ae 

mlasyrggkq fmfpdgktf i esaeetkig/e 

lklaldtynq thgdltnfkt ieel-ategva 

veysndyqga aaevlalnqe i inqgyfi1 j n 

lqqgsstped fitsqsiddf itgeeci^td 

ndvpvqingq tisatstegy vgkjtihyev 



LA: GAG; GNF PHV< RQAIDEV VAGI .ACQ DRV 
VNTQEEE'DKS QFIA-GFGDAAR T A ' GTAPG 
NOEL I .E.AEHK TNTEOSINEY PETE EPEQVS 
A YE IE E : EES SVNAI YFDKYK T A ' A EVKQE 
FEZ ;i ATA SLT I ANQFDI QEAOETDDAG 
KEMP A 1 1 



EF127-1 (SEQ ID IK : 473 : 



TAGCGAAAGA AAATAGGGAG L AOT/ JIAATG 
ATGGTGATTG TCGCTGGAAG / GTTGTGGGA 
GAAGCGGTTA AAGCAGGAGA TAGAGAAGGA 
AGTCTGGCTG ATTGTAAACG GATATTGGAA 
GAAACGGAAC CAGTCGATTT AGTAG-TTGTT 
TTTCCACATG TAAGAGAAG I G-ATTGATGAA 
GTGATGCTGG CTTCAOATC I ' A.A A ' -GAAAA 
ATTAATTCAG CTGATTATGA OAOGIEvTITG 



TOOAAGEAAG CA.AGGAAATT A AG .AOO'GACA 
AATTT AAGTC CCAGATTGGC 01 1 A I TGAA 
ATGACGAATA CGGTGAAAGT GA./ AGAGGAC 
GGACAjIGCTA CTTTCCCAGT TIAGAG IGGGT 
G A.*-. G A I GGTA G T I G TAG TTT TI 'I / ATAAT 
GTGGTIO'AAG GCTTATCTGA H • . AGCGC 
CA.',TTTATGT TTGGTGATGG AAA .•• IAAAA 
C Z G GT : ■ AATA C AG A ATTGAC TEA '.: ". ■ AT AAA 
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TABLE 1. Nuciccr.Kk a:o: A 










. 7ov Geo,- 






A G 7 C AA. T T T G 


_ ^ r ... „ ... 






-"■ 


GTO 


- . .fi ' - 1 _ 




G.O0CA 


'--3 jr. ^ 1 j/uv.". ^ 


, . , 




~ ■-- - 


07 G 


000 : 


- . ^ . TA„-,0 


7.7 


7G«<AA 


r\ — 5ihll J 




7.77. ; 7 7 77 7 


7 7" 7 


707 


070 


TAGATOO 


: 


TGCAT 






.7.7": . 7 ; 77, 


777. 


077 


770, 


0A7AT7 7 


: 


: .7AGTC 


"^""AGT^GA - irr 




: '07 ; ~.-"-o; 77' 


77.7 : 


7 AG 




7TTTA.G 0 


7 


-077AA 




- - - - - 






0TT 


GO 7 




: 


. . 7 TTTA 


AGTTCAG^GA 




7 0 . 7-' . 07' 


07- -. 


0_A G 


to;- 


■0 7 7TT7 




. OACAA 


GAGTTG'^A ^ r 




'77 70 77,77-0, 




OTA 


770, 


0 7 7AAT7 


0. 


: ATGAT 


TTTACAAC : "" r 




- ^ - 


77" 


70G 


7 07 


OA T 7G7:0 


AO 


07AACA 


GOTTCATTA ^ 




7 7007 0~7777 




7T7 


70- A 


7 7 77TA7 


G . - 


ATGCT 


GGAAATGA'"" 1 ^ 


-L o- -l - .. 


70070.070;G70, 




OTT 


7A0 


7 AAG TAG 


7 0 


.0 AAGGT 




/-iC^OTC A7GA - 




7777,,' 


-"„«A 


?l - 
/-i ^ , 




7 


7 OAGCA 


A ^ ^ C T 1 ^ j T AA 




O07'7000 770, 


0 77,0. . 


. 7TTG 


7TA 


07 7AA77T 


T7 


7- .OAGCG 


A 7GATT77TA 




7 7 7 7.. A7"777 


T,"0 0-- 


- i OA 


C ( 77." 


AGAAOA 




" 0 07 ATT 


a Ti r. Ti £ ~^ *i t r 




.7 7.-70 7 '7 7 0 


007 77 


. . 77 A 


AO 7 


" : TGA.^ OA 


7" 


0 .GAT 


1 ' JT J .rt ± \J 1 _ M 


A-t^-vO A 7 : . ;07T 


770 0'-" 7 7-7.70 




7 7TT 


GG-A 


;7 7AA0C 


7," 


0 TGGTG 


i bALA 1 1 .-i 


A. ' F AGiA. 0 r 00 7 T 


.7 777 ; 07000 


0707-" 


j 




O7 3A7 0A 


r , 


7 " 0 T AAA 


UrA Ibl 1 AC Aj 


C rAAGGO. 7. 7 




07.7,77-' 


. ~. _ m. 


A.-.G 


0.AA7TTT 


T - 


■ " TGAAC 


A A APA A O * 1 ^ 

AAAlMbLrtb 


A7AGGT7G7' 77 


0 07777 000,70 


07" 7 


-' 70T 


at;. 


OYOTG-7,7 


T, 7 


7 0 7ACT 


AAAATTAAAA 


CTGA0G7A0 2 


'•- : - " ' 77 700 


77'7 


7 7^77" 


A 77, 


7T3A„h 7 A 




07 7 ATT 


C C AA C C AA.' j 


- . . 


7007' - 70.77 




.■ \ - 'j 


TO-T 


7A7ATT7 




-YOCCA 


AC'_G .1 AACA-_ 


CA2CG 7 I A7 : T 


7 707 00..7 7 


A77 




A77 


; ag AA 7 ;; 


A. 


OAOCAT 


TTAGATTTAA 


C 7AAC "7- 70- A 




7077. 


_77 7 


T7A 


7AA7A 7 7 


T7 


' 0 07TAAC 


G AAAC AAG'_ A 


CTTGGA7 7 "A 


7 7 7.7. 77 AO 7 




.-a 7 A 


TTA 


7 ?kAA G7 




. .. ATC 


AL IbA lb? 1 AA 


AA.GTCA. A 7 A 


07A77O7OGG 7 


A70. 77. 


" . OTA 


CAG 


7 T AAG 7 




7AACA 


A A 7\ A A A rp 7\ 

LAAbAAAA I A 


ACAAAGTAA 7 


07"I'7O. ' 0^0777 7 




. 7^,7 


c:cg 


\7AGCTA 


T 7. 


7 7.0TTTA 


AGIGGTCATA 


C G T A 7 j". T 


7:0 7 70007,7 7 


A 77'."-. 7 


7 T7A 


A7G 


7 7 AGO 0 7 


A 7 


0A7GAA 


bAAi 1AGCA~ 


C'7 ^ ; ' 7'.-/^ 


O77O,:7 7I077 7 


A7077 


•7 77 


AAG 


7 7GAG7T 


'7 ; 


"7TTGGC 


AACGAAGGTG 


A7GTG7TG 7 A 


77777-7. 77,0 


7 777. 


7AA 


CA . ' 


■A7CT0 7 


A 


77 7 7CCA 


GAAGArCCAA 


CGATTA7 AA.A 


0707 ,70707^0 


GO ; 77.. 


7 0^7 


ATT 


TAGATTT 


AO 


" 7,7CGT 


f'A^f'^'AA^-'A » — 

bALLAAbA.". ^ 




7 7 7, • 


0 77 ~r 


0 7A 


AO G 




'7; 


OGGACC 


LAAGCCAG'_A 


TGGTG 3ATGA 


0770;0,7 70,0 


07 , 7 : 7 


: 0 7A 


TOO 


7 AG AG 7T 




-7 ,-TTNCT 


GATslGAAAATb* 


GCAA^ 77*77' 7 7 


OO0O.70O7^7T 


Gl7_777 . 


TAA 


CAG 


r ,7iGAA_0-0 


TOO. 


./'-.AGTA 


ACTTTTACTA 


TG.AACA-„A,A.A 


7,0777 7 AO AGO 


77.; : 


7'01'T 


TAG 


7T7GT7"A 




777 AC A 


ATGACTATTA 


CCACTA.^A,*,7 








A07 


"-. \ TTA 7-0 


C- ' 


7ATATT 


GAACAAGGCG 


GGATT7 7CAA 


1 7 '7 7. 7\ « 7" C G A ' 7 




". '7TG 


G " ' 


* "GAA , - ; " 




77TGTTG 


CATTCCAACA 


AGCCAA7C 71 




G70 7 7. 


■ ■■ — _ 


CAG 


-A 3 AC C 7 




.7-..0ACCT 


GAACCTAAAC 


AACCGCTAAA 








CT7, 


7.\AATCA. 


T 0 .- 


. 7 770 AC C A 


ACGAACCCAG 


TCAATTTT73 






• -AA 


TT7 


0TTTA7 7 




7 OOCTAAT 


ACAACAGTAA 










TOO 


7TATA7T 


r::: 


' 70.TTAGC 


TTTGGCATAA 


CAAAAA.ATAA 
















EF127-2 (SEC in 100:774 
















MF KKATKLLSTM VI7A7-T7V 


07 0.7 POlvOGAEO 77" 


. •••GTEG>: 


7.it*ti:y>: 


GO . 




LADCKRILEG 


QATFPV 7'A70 






. ONF 


PHY 


PLAICE Y 




: OQ DRV 


MLASYRGGKQ 


fmfpdgkti;: 




\0 . 7 


. 7GKS 


QFY 


7' GFGDVP. 




: 7PTAPG 


LKLALDTYNQ 


THGDLTMRKT 






7 7HK 


TNT 


Y3SIIJEY 


FO " 


7-7 PLQVS 


VEYSNDYQGA 


aaevlal:\7e 




A YW 7; 7 


O SLS 


SYI7 


0'7FDP;Y0 


TO'- 


' 7PF-/KQE 


LQQGSSTPED 


FITS^SIOCF 


7O770 77 7/- VKLj 


K77. 7 7 


7 7 OTA 


SOT 


I ANQFDI 




7 "A TDD AG 


NDVPVQINGQ 


TISATSTEGY 


vgn:o7hyev 




: . OAT 


LVS 


0GTM1IQG 




7 OF PEAT 


IPKNDNAHAC 


DVTPEDPTIT 


?"E 7E:77EHLD 




OF DW 


HYF. 


TAFGNET 




'7 , 7ASMVD 


DINKVLDIID 


VKVTDEMGKD 


7'TOI-]7'7V7'QE 




0:0' K 


QAD 


OYDYLSO 




"7o;t:ttk 


IKTDATDEEL 


APYIE7G77P 




GO""" ■ " 


' 'KPT 


VTP 


PPVDPMI 




OGQEHL 
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TABLE 1. Nucleotide and AnxM.: 


/ cqeneix • if £. faccalis Genes 






DLTNRDQEFK WNVKTAFGUE T: 'TV." 


xxxm mtt d\xx-tdengx 


dv; ■ 


XGKVTQ 


ENNKVTFEMN XQADSYDVLM Ci::V 


xmtitt k tee laxyieqgg; 


pit// 


MXNFGN 


EGDVLHSNKP TVTPPAPTPE XXTX 


• : , xxm. . xrd qekxwnvkt/. 


FG I ." . 


XMTWTQ 


ASMVDDINKV LDITDVK7XX EiJGK' 


x.xtxt 'x/m , x . x:vt ftxnkkddst 




XHTYTM 


TITTKIKTDA TDEELAP7 IX CGGI 


x . ; : x nxgxx' :x t lh snkptvtppa 


pt?: 


MPKKPE 


PKQPLKPKKP LTPTNHQAPT NMTX 


am / m kxi mi m xtnt tvuplymiag 




TMAISF 



GITKNKKRKN 



EF127-3 (SEQ ID NO 475'; 



GAATCAA GGAACAATTG CTAA / 
ACGATTCCTA AAAATGATAA T : 
ACAAAAGATA TCGAAAATCA A3 
TGGCATGTCA AAACAGCCTT 'J' : 
GATGACATTA ATAAAGTG IT A; 
GATGTTACAG CTAACGGTAT A 
AAACAAGCAG ACAGCTAT jA C*I 
AAAATTAAAA CTGACGCAAC G . 
CCCAACCAAG CCGACTTAAA C! 
ACCGTAACAC CACCGC I A XT T: 
TTAGATTTAA CCAACCC-CGA T. 
GAAACAAGCA CTTGGACTCA A: 



'i xmcax; 

\ / L- AA A IX 

v ATT G 

' •' : AA G 

XAAGT G 

.AGAA TI7 

A AAT GA; 

XXTAT AT3 



x:x-tg ct; 



at mt 

jTCA 



-tga cgxmagaag.m tg.v macgatt 

MA ATXGTGAAGA TA7XXTCGAT 

\CTT GGAMCCAAGM CAM ' ATGGTG 

\AAG TCACCGACGA AAMX -XT AAA 

-.ACA AA ; ':TAACTT' M TGA MMTGAAC 

\cgt atacaatgat tat t.ac c act 

:ctt aoattgaaca. agg-'-gggatt 

■ACG TGXTACATTI caa;:aaacca 

xaag Ac .itagaagg ac;xxaacat 

MAXG TCX/JVACAG T TT'T MGTAAC 
" AC A TTAAT 



EF127-4 ( SEQ ID NO: 476) 



NQG TIAKEFPEAT 



IPKNDNAHAC 


DVTPEDFTIT 


?'ix: xi 


:q: 


IHXD 


X'HXXM 


TDW 


HVK 


IAFGNET 


ST.: 


MM ASMVD 


DINKVLDIID 


VKVTDEKGKD 




,r-p- 


7 TGE 


knkyt; 7 


■MNK 


QAD 


>YDYLSG 


HTM 


XTITTK 


IKTDATDEEL 


APYIEQGGIP 








GDVLHM 


7XPT 


VT? 


>PVD?NI 


AMT T 


.'EGQEHL 


DLTNRDQEFK 


WNVKTAFGNE 








L DIM 












EF128-1 (SEQ ID I\!C ; 7 7 




















TAG C GAAAG A 


AAATAGGGAG 










M.AAG 


CAM. 


VG AAATT 


AT- 


^GACA 


ATGGTGATTG 


TCGCTGGAAC 








t-.-^n - . . 


",GTC 


CCM 


;attgg": 


^T^^p ^ "■ 


. XTGAA 


GAAGCGGTTA 


AAGCAGGAGA 










".ATA 


CCX 


TGAAAGT 


GA.A 


.GACGAC 


AGTCTGGCTG 


ATTGTAAACG 






- 7 A A 


GGAX AA 


^TA 


CTT 


MCCCAGT 


TCM. 


MJGGGT 


GAAACGGAAC 


CAGTCGATTT 


/Ml i 








MXTA 


GTGGTAGTTT 


TTM 


■'-.■GATAAT 


TTTCCACATG 


TAAGACAAG Z 






ix;aa 


GTGCXT 


/ AAG 


GCT 


tatctga 


TCA 


AGACCGC 


GTGATGCTGG 


CTTCATATC J> 






■XAAA 


XAAMX X 


■'■'.TGT 


TTX 


MTGATG 5 


AAiM, 


■ .:aaaa 


ATTAATTCAG 


CTGATTAT jA 






cgcgt;; 


-AATA 


CGC 


AATTGAC 


TTM. 


: MAT AAA 


AGCCAATTTG 


TCTCTGGTTT 


TXGA 


GA 


IGTT 


cggacx; 


:atg 


GTG 


GTACGC I 


AXvC 


A.'XCCCA 


GGATTGAAAC 


tcgctttaga 






::aat 




: ACG 


GAG 


ATTTAAC 


GAM. 


TTGAAAA 


ACGTATTTCC 


TATTAGTGAC 


AMAT 








ACAC 


GTTTAGATGT 


TTA 


MTTG CAT 


AAGACCAATA 


CCAATGATTC 


AXTXM 




TGAA 


TATT ; X 


ATC 


CAA 


GACATC I 


TC '1 


':'X AAGTC 


TCAGTGGAAT 


ATAGTAATGA 


T X 


C A 


XTGT 




XX AG 


AAG 


TTTTAG I 


GTT 


M A AC CAA 


G AAATT AC T A 


ACCAAGGCTA 






X ATT 




MATT 


GG3 


^vAAGTGT 


TGM 


/: MTTTA 


AGTTCAGTGA 


ATTCATACTT 


TGA'I 




^ J' AT 




AAG 


TGM 


GTCCTTT 


TG r : 


.'. MACAA 


GAGTTGCAAC 


AAGGGTCTAG 










' TTA 


CAA 


gccaat: 


TA" 


M \TGAT 


TTTACAACCC 


AATTAAAACA 


aat: 




C AAA 




TGG 


cc: 


MXvTCGA I 


AC-: 


■ ;aaca 


GCTTCATTAA 


CGATTGCCAA 






t - ^ ti 


ATTTA." 


'MITG 


cgaccgcta: 


GGM 


' MATGCT 


GGAAATGATG 


TGCCTGTTCA 






TTA 




ATTT 


CAT 


CAACTAG 


TAX 


"• M'-.AGGT 



1 
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TABLE 1. Nucleotide antl Air.::u- 


Acid Scqci.iL-: 






face 


:Iis Genes 






1 ALb 1 AL^GAA 


ALA 1 -_Av_GA x 


J. \ 1 , 


'.CGAa 


G r !G" 


GGG 


-AAA 


ATA 


:agcgat 


TGA.' 


GGAGCA 




Gj i AG i 'j'j- A - 


AjGTG 




GG a. 




■TTG 


CTA 


AGGAATT 


TC'G 


G.AAGCG 


ACbAl ICC 1 A 


TV 7\ TV 7\ 'TV 1 ft TV ft 

AAAA I ( -rA _. AA 


Ti-- l ._ Ij7 


lT . v.? 


1 L.. J. , 




TGA 


CGG 


TAGAAGA 


T 1 -- . .■ 


■ACG ATT 


AC AAAAGA I A 


1 CGAAAa I _.h 


A<- - . C 




GAT' 


GTC- 


TGA 


ATC 


^TGAAGA 


TAG' 


GTCGAT 


TGGCATGTCA 


aaaca _ T ^ 


TGY 








TTT 


GGA 


GCCAAGG 


CAG- 


ATGGTG 


GATGACATTA 


Tv rn 7v 7\ f\ ,~i"T.~> T' 

ATAAAG-T-^-- I 


AG ..■ 


T TG ATT 


Ga 


;■ a 


■. .a AG 


TCA 


TCGACGA 




GGTAAA 


GATGTTACAG 


CTAACGG7AC 




-. . A_.-u-\ 


GAA 




■.AC A 


A.^Z 


TAACTTT 


TG.A: 


ATGAAC 


AAACAAGCAG 


AC AGO T A EGA 




■ - -~- * : 


g G'.r 


TAT, 


GJGT 


ATA 


CAATGAC 


TAG 


ATCACT 


AAAATTAAAA 


CTGACG 7AAC 


GGA Z 


^- .rUA j: .' r-i 


1TA 


gcg 


TT 


A C A TTG AA C A 


AG ' 


' '• J ATT 


CCCAACCAAG 


CCGAC TT AAA 


- - - - 


■■*" - 






PiGG 


TGTTACATTC 


CA : - 


■A AACCA 


ACCGTAACAC 


caccgc gatt 




' ' c A A \T 


ATT 




A AG 


ACG 


TAGAAGG 


ATC-.. 


...GAACAT 


TTAGATTTAA 


CCAACCGTGA 


TTA A 


-gaA. . 


A AG' 


tgg 


.AGG 


TGAAAACAGG 


TT ■ '■ 


GG^TAAC 


GAAACAAGCA 


CTTGGA CTCA 




/.GC ATG 


'- 


J AT 


.■■AC A 


TTA 


GGT AAA GT 


GTT. 


G AC ATC 


ACTGATGTAA 


AAGTCACAGA 


TG.-iA 


gat ttt 


v-. a a 


-'AT' 


-TTA 


CAG 


GTAACGG 


GA:T. 


.GGAACA 


CAAGAAAATA 


ACAAAGTAAC 




GAAATG 


A_\C 


GAIT 


.AAG 


CNG 


ACAGCTA 




TATTTA 


AGTGGTCATA 


CGTACAC AAT 


L-./\C'.. 


A.TTA :T 


AGT 


:,AA 


TGA 


AAT 


GTAGCGG 


aa: 


' GACGAA 


GAATTAGCAC 


CTTATATTGA 


A ^. „ \ . i 


TGTT GT 


A T^ ' 


"' T G . 


,AGC 


AAG 


1'CGACTT 


GAG ■ 


'TTTGGC 


AACGAAGGTG 


ACGTGTTGC A 


TTC C 


AAC. AAA 






■ "AA 


ca: 


ZACCTG T 


A - - . 


C, TGCCA 


GAAGATCCAA 


CGATTAC AAA 








T A A 


:.GAC 


ATT 


TAGATTT 


AGG • 


GAACCGT 


GACCAAGAAT 


TTAAATGGAA 




AA.YACA 


GGT 


:tg' 


'GTA 


ACG 


AAACAAG 


C A G . 


TGGACC 


CAAGCCAGCA 


TGGTGGATTA 






G'G ' 




g:a 


TCV 


TAGACGT 


GAG 


". G TTNCT 


GANGAAAATG 


gcaaagattt 










'TAA 


CAG 


A AG AAA A 


TA ■ , 


GAAGTA 


ACTTTTACTA 


TGAACAAAAA 






TAG 




GACT 


TAG 


CTGGTCA 


TA G 


. GACACA 


ATGACTATTA 


CCACTA.AA.AT 


TAG A 




GC A 




' ATG 


AA Z 


aattag: 




: GATATT 


GAACAAGGCG 


GGATTCCC AA 






TTA. 




TTTG 


gc; 


ACGAAGG 


T j A 


GTGTTG 


CATTCCAACA 


AGCCAACCGT 






gga 




JGC 


CAG 


/■ AGACCC 


AA."-. 


■■G/vACCT 


GAACCTAAAC 


AAC C G C T AAA 




aaA.AAA 


cgg 




".TGC 


ct; 


T AAATCA 


t : 


GGACCA 


ACGAACCCAG 


TCAATTTTGG 


A A. A A 




A TT 




■GAA 


tt: 


ATTTACG 


AA":'' 


.'ACTAAT 


ACAACAGTAA 


ATCCACTTTA 


T ATG 


ATC j ZA 


CGT 




'■.TTG 


tc: 


T TAT AGT 




; ATTAGC 


TTTGGCATAA 


CAAAAAATAA 


AAAA 


AGAAAA 


AAT 















EF128-2 (SEQ IE KC : 4"8; 



MF KKATKLLSTM VIVAGT.^V 




PTLALAEE ACGC 


GCTEGM T MTVKVKDES 




LADCKRILEG 


QATFPVC-AGE 




CG ; V\'E 


I AG. GG . 


:;t;f 


PH'vFQAIDEY 


VC TT 


..GGT DRV 


MLASYRGGKQ 


FMFPDGKTK I 


ICS AD'; 


CCT'GR 




c ?:s 


QFYGGFGDVK 


rpy ■ ■ ■ 


CTTAPG 


LKLALDTYNQ 


THGDLTiGTKT 


Y r _.Lj '■. 


TT T VA 




G.HK 


TNTITDSINEV 


F E • ' i 


YFLQVS 


VEYSNDYQGA 


AAEVLALt;;.E 


T T G 


YC. YIN 


AC". : * 


:-LS 


SVITGYFDKY}; 




GFWQE 


LQQGSSTPED 


FITSQSZDCF 


TT Li- 


: YKG 


F.GA^ CY" 


/ TA 


SLT ] ANQFE 1 


CT T -.C 


• C'TDAG 


NDVPVQINGQ 


TISATSTECY 






fkiggy: 


.;jiT 


LV5Y r 'GTMNC C- 


tia; 


TFPEAT 


IPKNDNAHAC 


DVTPEDPTIT 


ft: ec 


C'EFLiD 


lYC!!: :G 


." LW 


HVF'J 'AFGNET 


GTV.G 


CASMVD 


DINKVLDIID 


VKVTDEIIGKD 


\ ; TG.NG 


TCTQE 


ITH'C'Tr 


GGJK 


QAITYDYLSG 


FTYT 


'■'TITTK 


IKTDATDEEL 


APYIEQGGIP 








C- PT 


vti-T'Pvdpn: 


AFLT 


"EGQEHL 


DLTNRDQEFK 


WNVF.TAFGIIE 




C' AG ITT 


dd : 


._ CTT 


DVTYTDENGK 


evt; 


TYGKVTQ 


ENNKVTFEMN 


XQADSYI YLS 


Gli'CYT 


:CT I TT 




. ..EE 


LAPYIEQGGI 


r MC : 


. T LNFGN 


EGDVLHSNKP 


TVTPPAPTPE 






QGiY.YT 


"TRD 


QEFKWNVKTA 


P'GIY 


TSTWTQ 


ASMVDDINKV 


LDITDVT\-XX 




■\ 7 TE:NG 






FTi-uJKKDDSY 


GYFG 


giity™ 


TITTKIKTDA 


TDEELATVIE 


QG G J PNQADL 




,'LH 


SrTKPTVTPPA 




TI KKPE 


PKQPLKPKKP 


LTPTNHC-APT 


IJPCTT 


GKSA5 




:tmt 


TTGiT ? L YM I A G 


L I Y '. 


CYAISF 



GITKNKKRKN 

EF128-3 (SEQ ID NO. 479 



WO 98/50554 



IS98/08959 



TABLE 1. Nucleoli.' and 



AG A TGAAAj-iTGvr 
CAA3AAAATA AC 
AGTGGTCATa CC 
GAATTAGCAC CT 
AACGAAGGTG AC 
GAAGATCCAA CG 
GACCAAGAAT 
CAAGCCAGCA 
GANGAAAATG 
ACTTTTACTA 
ATGACTATTA 
GAACAAGGCG 
CATTCCAACA 
GAACCTAAAC 
ACGAACCCAG 



CCTGA 



T'l AAATGGAA 

tcgtgcatga 
g:aaagatct 

ccactaaaat 

G'-.-ATTC CCAA 

agcca,..ccgt 
aaccg; taaa 

TGG 



EF128-4 ( SEQ ID 



DENGK DVTANGKVT^ 
ENNKVTFEMN XvADCVCYLC 
EGDVLHSNKP TVT? ?A?TPD 
ASMVDDINKV LLTTC'CKVXX 
TITTKIKTDA TDEELAPYIE 
PKQPLKPKKP LTPTCC1QAPT 



ACCAAACAC 
TCACAGACC 

T Z, "m."<.-^"" — 

AAGAATTAG 
G CAACGAA J 
C.A -AAGAC 

ctacaaat : 



\TCTA 

tgaa 

ACGGC 

:;;:ca 

TCGT 
"-GACC 
CTNCT 
AAGTA 
\CACA 
\CATT 

cgttg 

AACCT 
AACCA 



IE LACCIE^G- 

t cectwwkc 
t fticckkddc 
t stttvtP: 



ATGN 
-" TWTQ 

-tytm 

KKPE 



EF129-1 (SEQ ID :; ) :4£± 



TGACAAGTGA 


AG AAAC 


CTCT 






: . . cat 




TCATTAAC c: 


" ': CAGCA 


ATTGCTACCC 


CAAGCA 


CCGC 






-::.TA 






. A.AAAAT 


CAAGAAATTT 


CATCAT 


TAAA 






• . AG 




CCACAACT AT 


TTCTTTA 


GAAGCAGAAG 


TATCTT 


:agt 


r~. - + - 




vc ttt 


TA 


:gtgaaca aa 


•'.-AACA 


CTAAAAGCAA 


AATCAG 


AACA 






"A " - AAA 




CTG AATT A A: 


CACTGAA 


AAAGGTAACG 


AAGCAA 


TCAA 










cttaatc-g ,,c 


. taca 


ACAATGCTAG 


atgca:- 










AT 


ACAATCAG CT 


' TTTCAA 


GCTGTTTCAA 


CAATCG 


TAAG 








AA 


CAAC AAA A AC 


. .CCAAA 


CAAGCCGTTG 


TT^ATAAAAA 






' . . '-.AG 


TG 


AAACAA AC TC 


C-iACA 


GAAGCTGAAT 


TAGAAA 


CAAA 




a/-tat t:a 


t . • CTA 


AA 


CAATCTGA AT 


T-AAACGTA 


ATGAAAGCTT 


CAT TAG 


:att 






A A AAA 


GT 


tctaaact cc 


: CTAGAA 


AAACAAAAAG 


CAACTG 


:tga 




_ : AG C AA G C 


V V " A TAG 


CT 


GCTGAACA AA 


AATCTGCA 


GCTGAAAAAG 


ccaaa: 


aag: 




'.A; A AAA CCA 


"AC A AG 


CT 


C AAG TGAA AG 


■•CAAGCA 


CCAGTTGCCT 


CTCCAT 










CA 


AGCTCAAG CT 


: CCTAACT 


GAATCAAGCA 


cg:aj\c 


AAA: 






'. ■ ■C CAA 


GT 


ACAGATAA TC 


•AAAC A 


GAAAATACTG 


GCTCTT 


CTTC 






- ' C AAC 


CT 


acaacacc AA 


. CAATAAT 


GGAAATAATG 


gtgcct 


AAAC 






t 


CA 


ACACCACA AC 


CACACCA 


GCGCCTTCTG 


CT3ATC 


CAAC 








TA 


CGTCAACC AT 


' . CTTTA 


CGTCCAGTAG 


TATGGG 


ATG C 






. -AA 


CT 


CCTCGTGC AC 


. AAGTT 


GAAGCAGGTG 


GCACTC 


CAAA 






. - ' - GAG 


AT 


AAAGTTAT CT 


ATTATG 


TGGGCGCCAG 


GTAACI 


CAGT 








AA 


AC AAAC AT GC 


AC CAGCT 


TCAGGAAGCG 


GTCACC 


GTGA 






.■ . _ : CA 




ACGCGT AC CC 


'CTTGGT 


TACTCAGGTA 


GCACAA 


TCGT 















EF129-2 (SEQ ID NO:482 



WO 98/50554 



VC'l GS98/08959 



TABLE 1. Nucleotide and Amino 


\cid Sc 


qaiCACA 


> of £". faecal is Gems 






VKKRLFASV LLCSLTLSAI AT 


"'A ~ 


7ADK VCKK ! V. 


eisslkakqg a 


la? :■ 


. SLE 


AEVSSVFDES MALREQKQTL E. 


: 


. :..ve 


ta a a 


. . . EK RNEAIKNQAF 




•ASTT 


MLDAVLDADS VADA1SEVQA 7 






la,. . , : 


: : awdkkaene 


«;'■;■ 


. AEATE 


AELETKRQDL LSKQSELNVH K. 


7 L : 




AAA . 


A EE QKAAAEAEQA 


RAA ■", 


, A.AAA 


EKAKQAAAKP AKAEVKAEAP V. 






A. . V - . . - ' . 


. ; 7E SSTQQTTET- 




AGATE 


NTGSSSSEQP VQPTTPA DNG M 


iggc 






" V' r .! 'PA PEADPTINAL 


ice , 


. A. LGLR 


PWWDAGLAA SATARAAQVE A 


;gii 


AOHWA 


ra:a" 


: agmw apgmsvimav 


7 : J e 


GTVTAS 


GSGHRDWEIN PGITRVGFGY S 




7GHCA. 











EF129-3 (SEQ ID IK):483 ) 



ggac aatgttgata aaaaaat7 1 aa a^aaaaaaat 

caagaaattt catcattaaa a ;:aaa\:aa ggggaa aa'AG cttcacaagt 

gaagcagaag tatcttcagt a tttg a tg aa agcatcactt tacgtgaaca 

CTAAAAGCAA AATCAGAACA ATTACA-Y:AA g/cataaaaa ACTTGAATCA 

AAACGTAACG AAGCAATCAA AAAT. I AAGC A CGTGAA A7TC AA3TTAATGC 

AC AATGC TAG ATG C AG TTTT A AA A- - A A AC T : A 7 A A ; AG AT 2 C AATCA C- 

GCTGTTTCAA CAATCGTAAG T AAA'AA JAAC G-7ATAA7GC AA 1 3 AAC AAAi" . 

CAAGCCGTTG TTGATAAAAA A "; ~AA AAAGG-.C GAGAAAAAAC TGAAACAACT 

GAAGCTGAAT TAG AAA C AAA ArAACAAAAT TAAA a: ATA AACAATCTGA 

ATGAAAGCTT CATTAGCATT A^AACAATCA TAAGA" A AAA GTTCTAAAGl 

aaacaaaaag cagctggtga ;aaaa-:;a g:aagaatag ctgctgaaca 

GCTGAAAAAG CCAAACAAGC T 3 ITGCAAAA C:AGCTAAAG CT jAAGTGAA 

CCAGTTGCCT CTTCATCAAC AA:AGAAAAA CAAGCAa:CAG CAAGCTCAAC 

GAATCAAGCA CGCAAC.AAAC AA I'TGAAACA A ."A- - ''.-7 AAA GTACAGATAA 

GAAAATACTG GCTCTTCTTC AT:AGAA:AA C:AGTAAAAC CTAAAACACG 

GGAAATAATG GTGGCCAAAC i ^TA7AGh AA7ATTACAC C AA C A C C A G A 

GCGCCTTCTG CTGATCCAAC AATCA. AT' jCA TTAAACATTC TACGTCAATC 

CGTCCAGTAG TATGGGATGC AG3T77E7:A G I77C73C AA CTGCTCGTGC 

GAAGCAGGTG GCATTCCAAA 7'; A"' 77. TAG T :'7"G7AGAG ATGAAGTTAT 

TGGGCGCCAG GTAACTCAGT /ATCAAAGAG T 3 AAA ; A. TG AAACAAACA7 

TCAGGAAGCG GTCACCGTGA 7TAGAAAATT AACAA A ACTA TTACGCGTG7 

TACTCAGGTA GCACAATCGT AGGAAACTCA GAA 



tga ■ 

ATT/ 
T • 



TAG 'A 
AAGA 
AT- 



AG C ■ 



GGT 
CG A 



AATTTA 
AAAACA 
YaTGAA 
ag:aca 

7TCAA 

aacaaa 

"-GAACA 
AACGTA 
I'TAGAA 
ACTGCA 

AAAGCA 
ACAACT 
A-CAACA 
AATAAT 
ACACCA 

; ATTTA 

agagtt 

■CAT ATG 
• : AGCT 
7TTGGT 



EF129-4 ( SEQ ID ND-.484) 



DN VDKKIEEKNQ EISSLKAKCG A I 


.A S;>\'L5SLE 










AEVSSVFDES 


MALREQKQTL 


KAKEr 


:;loqe iteeea 


\ j EK 


RNEAIKNQAE 




JAQSTT 


MLDAVLDADS 


VADAISRVQA 


gate 


g a. hag lm 


TKQ 


AWDKKAENE 


KE7A 


jLEATE 


AELETKRQDL 


LSKQSELNVM 




A A A A A AA.-:. A • 


A .EK 


QKAAAEAEQA 


elaa: 


a;.?;aaa 


EKAKQAAAKP 


AKAEVKAEAP 


w.sa: 


T A'EAQ AFAASA 




SSTQQTTET7 


tpaa: 


A A ATE 


NTGSSSSEQP 


VQPTTPSDNG 


IIKGCA 


■ r ' a gat v:a- a: 


A PA 


PSACPTINAA 


MY LA 


A LGLR 


PWWDAGLAA 


SATARAAQVE 


accg : 


a:aa;e raai ' 


: mw 


APGKSVIMAA 


i ! . i . . 


"TAS 


GSGHRDWEIN 


PGITRVGFGY 















EF130-1 (SEQ ID NO: 485 . 



TGATACATTA AAAGGAGGGA 
TTAATCAACA GTTTATTAGT TATA/ 
CAGATACGTA GTTGGGTGGT 7C A. : -. " 
CCAGCTGATG TGAAGAAAAA TATGA. 
TCCTTGAGCA CAGAAGCGGT 'AATiA T 
GGTGCCATTG CGATAGCAAG TGTC 1 
GTCGCTTTAT TAACTGGTGC :.*GGG/ 



:gcg: caaaaaaaga aaaaaagagg 
;7,ttt a7c//a73a ct tagcgttaat 
:,aaat aaaag'-^'agt acgccgttag 
::tcgt ga/gaa^aagt ttgactttga 

\.AGG 2 CAATT77AAA ACAAAAACTT 
AAA7T AAATA^a: AC A TTTTTAAAGA^ 
AC ATG AGlA.AA7-AAAT2 AAGTCATGGA 



;iAAAAATTGG 
T7TAAA.CAAT 
CA7'.'" : 77GAAA 
TI'C A 7TGAG 
ACC7 1'^'GATT 
AAA ' ' A AAAT 
GAA . .A AAAT 



WO 98/50554 



S98/08959 



TABLE 1. Nucleolus and A::;:: . aa ■ A CL.il is Gcr.cs. 



TATGCCTTGG CTAGAGATCG AAA A -AT 

A C CAAAAAAG a c g a a. c t C A T "I .-" ■. . - -. ^ --. g . 

acttctgtag aaaaaaagga a : a: 

aa ta tg att a c c t t aa ttac g : a . :; ;, t 

ggaacattag cagcaacgag gg a gaaaa 

C AATTGG AG C AAAAAA : TT A A : ' A A _ GG 
EFI30-2 [SEQ IE A :4c-" 

yikrrenmrp ke?aaagaa ::agl. gaag 
advkknmare ttfdfcg7eg : c a. va-'av 
alltgagtmk ed;aaa:aa" a: , a .. : ggg 
svekieptrv elagaagaga :aa:a:a;: 
leqktladot a 

ef130-3 (seq zd a. : 4 " ■ 

cgttag caagttgaaa 

ccagctgatg tgaagaaaaa tag; •• a - a;t 

tccttgagca cagaagcggt aa - aaaa 

ggtgccattg cgatacjaag aa AAATT 

GTCGCTTTAT TAACTGGTGC AA A AAA; 

TATGCCTTGG CTAGTCATCG AA: . --A. -.GAT 

ACCAAAAAAG ACGAACTCAT 77AGAACAGT 

ACTTCTGTAG AAAAAATCGA AG ' "' GG 

AATATGATTA CCTTAATTAC C7A7GACAAT 

GGAACATTAG CAGCAAGGAG G C7TATTAAA 

CAATTGGAGC AAAAAACTTT AG A A A GAAG 

EF13 0-4 ( SEQ ID AA 4 ?P 1 

VSKLKP 

ADVKKNMARE TTFDFGSVES ASAAGAAAQ 
ALLTGAGTMK EDQVTAATNV AAAGGA7G7G 
SVEKIEPTRV ELIDIAPGQN IAGG ACGAL 
LEQKTLADWV A 



A GA7G7TCA " 7 A A' AAAG A 

AA CTGTTTATAG A:/ GGAATA 

: AA TA GATGAG G G 7 A " " G GCAA 

AA GGAGGCGAAA A : GCAA 

" TG ^ A'ATATG — G • ' ATTC 

;:;a agaaac*. ag :• g a alkp 

'ig aia:?svf:a gg : asnv 

GAT KrAELIYAG EGA" AAYKIT 

'GG TEAATTP1AE AAAAGKAFQ 



■ AT TTGACTTTGA TA ' ■ AG AG 

■AAA A G AAAAA C T T .A G G AGATT 

:GA TTTTTAAAGG AT" ' A GCAAT 

,,AC AAGTCATGGG GAAAAAAAT 

A ACT TATTTTCACC TEA .A GAAAGA 

AAA CTGTTTATAG AG; AGAATA 

A AAA TTAATGACGA GAG A; AGCAA 

- 'AA CGACGCGAAT TG ' AGCAA 

. ...GG AGGATATGG'T A,. ' ATTC 



• a; aaapsveia eiagaglsnv 

: AAT EACELIYITE AAA "GTYKIT 
G AG TLAATTPIKG GAG . AGKAFQ 



EF131-1 (SEQ ID !A :4S9 ; 



TAGGCGGAGG 


TAAGCGGTAT 




_ ~_--^A. Art. . . . . 






AGACATG 






G A G AAT 


TGGCTTTTTA 


TAGTATGTTT 






■ ■ - '--.^ 


GT 


GGTTATT 




i 'u : . .- 


A GAACG 


TTCTTTTTCA 


ctagagatt: 






. C A 


AA 


STGGTCT 


T 


VJl / ■_.-' 


AAAG AT 


CGCCGAAGTG 


ATAATTATGC 










ECACCAC 






'AGAA 


CTTGATCAAA 


AAATT :AAGA 






. -. . G G 


CA 


ITGATCA 


T 




AATGAT 


CAGGTTTTAG 


T AAA T AAAG G 








AA 


•\AGCAAC 






GAACACG 


CCAAACACAA 


ggttt:agat 






.A AAT 


n-i— . 


ACCACAA 




prri 


ATCTTA 


AAAGCAATTG 


AAGAAAGTAA 








± — 


GCTACGT 


T 




'G AGCAA 


ATTCAAGGTG 


CTCAGAATAT 






A AG A 


AT 


ATGACAA. 






G^ AAAG 


TTATC AGCAA 


TG C C T AATAA 






■ ■. G AAA 




ATTCAAE 


T 




E : AACAA 


AATACCATTC 


AA G T 1 - AA T AA 








CA 


GT AAATT 






ATTTTA 


GCAGGAATGT 


TAG AG AAAAT 






.GAG 


AA 


ttattt;. 


A 




AAA TAT 


CACAAAACGG 


CTGGTTTAAA 








C G 


TTATTGA 






A AAT 


AATTCAACAA 


GTTA - ^-G/iAT^- 1 






" .'AA 




C AAGTGG 






• ACCCT 



WO 98/50554 I C I I S9 8/08959 



TABLE 1. Nucleotide and Am::. Acid Scqcuc: 



E. jaccalis Genes. 



GCAGCTAGTT TTGCCCATGA ATT"' 
TATTGGTACT TACATCAATT AA- 
TTATGGACGT CTTCT:AG:A AAA"' 
TTACGTTTAC ACGGCGTTGA ACT 
AAGACAGGGG TCATATTGCT A.- A" 
GGTTCGTTGT TCLATGATGT AACC: 



- a a;a ca 

:*aaaa c: 
-tact g- 



"GG ATATGACGAC AGAAAATTTG 
TT CCACCGCACT TTA AA A.AAAA 

ta tctatgttca tg; .": , ;,attat 

'■iGG TTTTATTTTC AAAA-TATATG 

:ag cgaaatacaa agaaataatt 

J-,T TTTAA 



EF131-2 ( SEQ ID NO: 4 90) 



MRKRH AKKRHGGVNVv LFIVCLLW7 
RSDNYANLTK EIVAFPSGEL DQrTV 
NTRFQIGSIQ KSFTTTLILK AIAAC 
SAMPNNIVTD EEIIQFVKON TIQVK 
KTAGLKNFGF YETLLEQPNN STS V ¥ WTEDN SYL 
WYLHQLTSGH LVSTALLQKL WTAT ;OAYH GG 1 
TGVILLTNCV NPAKYKELIG SLFHD7TNLT VX: 



gasgyl: 
atyi ca 
■;ltld t" 
agkyn Y;- 



FTRDSQVSQ ESKWLH?:^ 

;aoq vlvnkgygfa • .. vantp 
'vr-z qgaeditiso mj: ;t::;glkl 

"L LA GMLEKMYQF.T YCA LIAANLYH 

taa asfahefgtg nvltittgdly 
ayl rlhgveagqq ata cask dmk 



EF131-3 ( SEQ ID 110:491) 



ttt aataaaaacg 

ttctttttca ctagagattc acaagttagt 
cgccgaagtg ataattatgc gaatctaacg 
cttgatcaaa aaattgaaga aaa aa attat 
caggttttag taaataaagg atatggcttt 
ccaaacacaa ggttt :agat tggctcaatt 
aaagcaattg aagaaggtaa ac tt a. c att a 
attcaaggtg ctgag3atat tacg.attagc 
ttatcagcaa tgcctaataa tata attac a 



AATACCATTC AAGTCAATAA A C AG A A A A T A C 
GCAGGAATGT TAGAGAAAAT GTATAAACGT 
CACAAAACGG CTGGTTTAAA GAAT'iTTGGC 
AATTCAACAA GTTATAAATG G A C A. ■ '? .A AG AT 
GCAGCTAGTT TTGCCCATGA ATTTCGGACT 
TATTGGTACT TACATCAATT AACA~GTG;;A 
TTATGGACGT CTTCTCAGCA AAGC TCTTAT 
TTACGTTTAC ACGGCGTTGA AGCGAGTA AA 
AAGACAGGGG TCATATTGCT AACTAAC TGT 
GGTTCGTTGT TCCATGATGT AAA7AATTTA 



caaaaa.- avga aagtggtctt 
a/-.- .a • a ag ttgcaccaga 
at*; ' ; :gg ctttgatcat 
g c : " a; to : T 3 aaaagc aa c a 

(AAA. .A A LTCTT TTACCACAAC 
GAAAC .AAAC TCGCTACGCT 
GAT AT TToA ATATGACAAG 
GAA A.YC AAA TTATTCAATT 
A ATT A" AACC CAGTAAATTT 
A C C T A A C AAG AA TTATTT AA 
TTA A ATA AAA CCTTATTGGA 
7CATATA ACCAAGTGCT 
;GCATGG ATATGACGAC 
'A! ATT CCACCGCACT 
.A A TCTATGTTCA 
.... ..... -. G T^'fTATTTTG 

; , ; A : . AG C G A AATAC A A 



gg; 

TAG 

taa 

AGC 

ctt 
tt; 



t:t 
taa 

A'C- 
CTC 
ACC 
TT" 
TG7 



ACAAGAT 
AA-CGAA 
AAATGAT 
CAACACG 
CATCTTA 
TCCGCAA 
: : TAAAG 
C.TAACAA 
ACTTTTA 
AC TTT AT 
: ::CAAT 
AATTCCT 
TGATTTG 
A C AAAAA 
AAAATTAT 
ACATATG 
A" TAATT 



EF131-4 (SEQ IE NO: 4 92) 



liktf fftrdsqvsq EEKWLEFIT: 

RSDNYANLTK EIVAFDSGEL DQK J 0 FT AY I GT 

NTRFQIGSIQ KSFTTTLILK AILTAKLTLD TA 

SAMPNNIVTD EEI3QFVKQN TIAOTXYU Y A 

KTAGLKNFGF YETLLEQPNN ST. ". :• CLAN SV 

WYLHQLTSGH LVSTALLQKL WTLA, , ACYH GG 

TGVILLTNCV NPAKYKELIG SLA"; i A\TXLT VA 



TDQ VLVNKGYGFA IjFAAAAANTP 
TQI QGAEDITISD MLAHISGLKL 
"LLA GMLEKMYQFC YQAAAMNLYH 

cpa asfahefc'ig iaa a. agdly 
a:yl rlhgveagcq aiaca tkdmk 



EF132-1 (SEQ ID NO: 493) 

TAGTTTTCTAATCTCACC AAAACAA;AACCTTTAA.A.A • A ^GGAGAGATCGTTATGATAAGAAAATGGAAAGTAGTA 
GTGGGAAGTCTGGGAATGTTCATT A' AA-TTAA ' ' .•■G3GCATGTTCAAC.AAATA.G" , .AAGACAAAGATACAGTG 
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TABLE ]. Nuc!ec:ijc ; ; :id An;i 



G C TT C G AA C G AAAA. A r I ' ' : .■ 
AT TG AT TTA C AC A G 7 A T 
TCAAAAGCA.GATT _ A.- _ 
rrATG CGAAC AAA G A G G A 
3AGAAAGGGAAGGAAGA' 
TTAGCGGAAAAAG ATT A 
G A C AAAGAAG C T AA_AT 7 ' 
TATTTCTCGAAAGCGTA 1 
ATAAAACACTTA3TTGA 
A TG AAAA C A G T AT C A A A 
G ATGGTG AT AG TTA C T A 



TGCGATG 



EF132- 



494} 



^-mrka t kw;g sl a:: : . : a : a : a : a ■ : :-': tv, 

lpedvq:-:tskadl: » Y^r-vur.XTG".: .:. a. v :at::: 

YAKN I EK WLAEKG P AN K K FY K EN 1 : 7 A.UA 

EEEGTPDCIKKLYEAAR7TKYPS7A A / 'AA;r.A' 
GLSK . 



EF131 



(seq :g : 435 ; 



atgttcaacaaatagtaaagacaaa 

AGCAGATATTACTGAAAATAYAGC;- 

atatgaa c c t ttg c c tg a a 3atgtt 
tggaggaaatgcttggtttacaaaa 
tggcatagatgttatttactyaga:- 

C GGTATT ATTTAC i 2 C T AAAAATAT : 

tctagataagtatattgaaaagtt; 
aaaaatgattgtta ; aagtg aagg; 
aatcaacactgaa^aagaaggaac/ 

CTTATTCGTAGAAA jTAATGTGGA ■ 
GATTTTTACTGATT 2 AATTGCAG A- 
AATTGCTGAAGGCGTTTCGAAA 



;aaa;c;. 



: aaaa. 



:attg: 
:tttt; 



,tatagcaaaagataaa 
A G atgttg aaaaaa c t 
: TAC aaaattagtaaaa 
■-.cttagagggtcagagt 
aaatattgaaaaatgg 
aaaagttggattctcta 

: TG AAGGATG C TT t AAA 

-aggaacaccagatcaa 
tg tggacgatagaccg 
7tgcagaaaaaggacaa 
:gaaataa 



l'FAAEDGIZ 

-:;t::::kk:-:: 
:r:vsTir: 



:ihsivpigkdpheyep 
■gkedphawlnlengii 
■ 5kaynvpsayiweint 
:: gs yyammkwnldki ae 



:gaaaaattaaagg' 
-.cacagtatc a: ac 
^gatttgatt yttt. 

r AAA 2 A G G AAA. : . A 

;aaggaagat : :: eg 
'^AAAvGATCCT ~ ata 
-■.GC1 /v/iATCT .a 7 
-AmA.GCGTATAATC 

:tta7TTgaaaaat 

\ G T A T C AAAA. 1 : A T A 

:agttactat3:'GA 



" TTACTAATTCGATTTT 
GGGAAAGATCCCCACGA 
2 G TGTT AA C TTGG AmAC 
■ T ATTTTGC AGC AAGTGA 
: TG GTTAAATTTAG AAAA 
.A-AATTCTATAAAGAAAA 
: TGAATTCCGAATGATAA 
: TGTGCTTACATTTGGGA 
AGAACAAAAGTTCCCTC 
: ATTCCTATCTATTCAAC 
■'lAATGGAACC tggataa 



EF132-4 ( SEQ ID NO:496) 



C S TN S K DK D TV A S K E K A K VW TN E I 
GGN AWFTKL VKXAKX E E ! TKDY F AA 
LDKY I EKLDSLDK E AKSKFAS IP: : ; 
LFVESSVDDRPMKT7SKDTIIIFIY. 



: v? i gkdphey7plf7 
:?hav;anle:;g: iya? 
:-ot?:7ayiwei::teei- 
:'ami:kv:nldk: aega:- 



■ . KTSKADL I FYNGVNLXT 
EKWLAEKDPDNKKFYKEN 
: 3QIKHLVEKLRTTKVPS 
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TABLE 3. Conservative Ami:.- Acid Substitutions. 
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Aromatic 



Hydrophobic 



Polar 



Basic 



: Phenylalanine 
■ Tryptophan 
j Tyrosine 

Leucine 
j Isoleucine 
! Valine 

Glutamine 
Asparagine 

Arginine 

Lysine 

Histidine 



Acidic 



' Aspaitic Acid 
! Glutamic Acid 



Small 



\ Alanine 

i 

j Serine 

! Threonine 

i 

! Methionine 
j Glycine 
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Hpiu oe-Bearing Po: 



EF001-2 


fro. i a our /-. : about i oys- 1 52. from 
25- . fr m ab . ut L;.o- ; -- > to about Lys-363. f:v 
A- -4' S. 




... 


EF002-2 


frma ammt • m :. ut A>p-S3. from ab. 
2S5 






EF003-2 


in::;; a'-out Am-2b5 to about Glv-266. 






EF004-2 


— . , , , 

from a^out A<n-23 to aoout Am-26. from abo;:* 
fro::; a'-out To. 15- to .;bout A\sp-1 59. 






EF005-2 


from a '-out 1 - s-24v u ..bout ( : Iu-252. 






EF006-2 


fro:;; a -out i b \ - 23 tu Aoout A >p-2S 






EF008-2 


frooo about T :: - v m to about Glv-94, from about 
105. from about Giv-2m to about Thr-289. 






EF010-2 


from a -out hv-129 to about Asn-131. 






EF012-2 


fro::; about Amp-"" to oonit Asp-79, from abo;. 
from a' l ouit A^p-25^ to about 7hr-258, from ab 
4b,v 






EF013-2 


from a -out T o: - 50 to about Asp-32, from abou; 
from about Gin- W>4 to about Asn-166, from a K 
195 






EF014-2 


from about Sor-203 to about Asp-206, from abo 
3 1 6 






EF015-2 


from about Ib'o-bb to about Gb-'-69. 







EF016-2 



from about l.vs-236 to about Asn-239. 



-)8959 



-ut Tyr- 
> about 



t Gly- 



Ser-87, 



Asp- 



:t Lys-98, 
b out Asn- 



■iii Ala-75, 
about Gly- 



nit Gly- 



EF017-2 from about S - -90 to about Gly-93 ; from about 
l c >9. from about Lys-230 to about Asn-233, fro: 
Gh -43!. 



uu Lys- 
; to about 



EF018-2 



from aoout Lys-159 to about Tyr-161, from abo; 
km. from ab oo Asn-250 to about Arg-256. fro:: 
Gly-395, frou about Lo- s-4 1 6 :o about Tvr-4 1 sy 



i bout Ser- 
2 to about 
: -428 to 
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Table 4. Residues Comprising Anoaomo Epitpe-Bearing Po: r. 





about \ru-4. • 






EF019-2 


lioaa Aaout /a ja-^O'O io aDOUi . ei-zii, irom «.u<o 

Z V v : ■ 






EF020-2 


from about Lvs-57 to about Asn-62. 




. — . — 


EF021-2 


from about Sor-33 io about Gly-35, from about 
from about Asp- 139 to; about Lys- 3 41, from a A 

Z? ! < ■■ , 1 1 Om oO .Hit v _J i 1 1 - ^ J LU a (JUL 1 VI _ Z / / . 






EF023-2 


from a'i out Lvs-232 io about Asp-234, from ao. 
.m.J' . i: om a.D ; Uii l ni-^a.> io aooui /vrg-HJu, no; 
Thr-4S0. 




— ■ - ~ 


EF025-2 


irom about Ana- 183 lo about Asp- 185. 






brU2o-z 


Irom aD0Ul l ci-„^ io anoiu /\ s p-au, irom a inn., 
from about Gin- 107 to about Asn-110. 






EF027-2 


— —z — - 

trom about Gin-/2 to about Lys-74, from aboa 

z ^ 






brUzo-z 


It 0 01 a» DOUl / N ^]J- J n 0 LO auULll ^Jlil-lOo. 






brUzy-z 


r,. ,.L_.,f \ .. i 1 i \ t , 'iKoni 1 \/c_177 from 

11 O m JuOUL ;a S|> 1 in U> dUUUt l^y^-lZz., lium Hi-- 

1 -. . 1 "A 






EF031-2 


from about Glu-30 to about Gly-33. 






cr \J J*+-Z 


from -ihmit ( 1 1 1 _ ° S m 'ihmit l7iv-77 from aboi; 










EF36-2 


fivm about Gin- 177 to about Ser-179. 






FFfft7-9 
i_<r uj / ~z 


iVnm nhont Ser-?S in about Asn-30 from abora 
from about Gln-10" to about Asn-110. 






EF038-2 


from ;ibout Asn-77 to about 1 ys-79, from aboo 






EF040-2 


from about Ays- 1 67 to about Gly-172, from a; 
Asii-242. 
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Table 4. Residua: 



"HI ■ 



.onst* 



Er:: o?-Bearin2 Pa 



EF044-2 


::. . ?out / .Shi' ■ iiy-194, t'ro:r. : ' 
- 






EF045-2 


l: . .'-mil . ,. : \n:'. hi . froir. .. 
I - -". ::.)inab ;.: \o a'". nil Gly-264. '.: 

(V.:.-? ! S. 






EF046-2 


i 1 1 a.bout S .•!"-!>• a'-vui G --2?, from aboi." 
tV( -"i ".bout 1 /;Nnit A qi-7S. 






EF047-2 


in i :bout / :c about / <p-?0. from a 1, 

2" 






EF048-2 


lo b ..bout / r- - about .oas-141. from ;a 
Gi->:54. 






EF051-2 


lo an about A :.p-~3 to a.bout i;lv-76 






EF053-2 


fivm about S:r-~ c > to about Goa-82. 






EF055-2 


iroao about/, ;p-2b lo a.bout Giy-28. from abb: 
fro : i a bout A " ( a - " : ' o a b out G ' v- 74 . from aba;. 






EF056-2 


fivm about A: li- A to about G5/-74. from aboa 










EF058-2 


from about l.ys-12S' to aboul ' ily-133, from a*' 
57 A from aboul iAo-t^o to aA.uit Glv-591. 






EF065-2 


from aibout Sor-236 to about ': yr-239, from ah 
35 A from about Lys-4 15 to about Asn-418, fia 
A: • -•'••^8, from about Asn-489 to about Lys-4 (; : 
ab. an Asp-5 . A from a.bout G.u-639 to about i_ 






EF066-2 


fivm a.bout Sjr-236 to about '."yr-239, from ah 
3:2. from about Lys-4 15 to about Asn-418. fro 
A-o— 48, from about Asn-4S l , to about Lys-4^ 
about Asp-5'. S. from about G,u-639 to about '; 






EF067-2 


fr.avi about S_'r-23^ lo about ".'yr-239. from ah 
3:1. ;: om ah an i_ys-A 5 to a'aout Asn-418, In 
As:> -48, from about Asm4£a- lo about Lys-^h 
ahoo Asp-5 A iaaa about G u-639 to about : 



b8959 



at Asn- 



■ut Gly- 
1 about 



:r-47, 



:i A sn- 



out 



Asp-69, 
Gly-89. 

Aly-89. 



~'Ut Tyr- 



out Gly- 
lo about 

-5 16 to 



out Gly- 
to about 
a : 16 to 



nit Gly- 
to about 
516 to 
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Table 4. Residues . aauprisim: Aiuiacn;^ Ep\u pe-Bearing For:: 







EF073-2 


fro n about Mct-^s io about Ag-100, from abo 
1 1 A 






EF074-2 


; - : ; 

from about Saroo io about I \r-59, trom abom 

1 1 0 . 1 i l) DOUL J ■ . 0- v : u 1 Li 00 111 U - 1- J UU, IlOni dOO. 

232. 






EP076-2 


from about A -;n-o -S to about 1 vr-40, from abt o 
from about 1 vs-79 to about G v-81. 






EF077-2 


fro n about /\ : iz--r i i lo about ^ ? ly-4 1 3 . 






EF078-2 


fro w Lib out 1 :r-A J - to about ( ily-296, from a:- 
3<A . from about GluO-^ to arout oly-52o. 




_ 


EF080-2 


iVorn about (Au-lb4 to about Gly-166, from ab. 
2(A. lioiTi about Ly^--_ to aoout Uiy-z4J. 






EF081-2 


from about Asiw to about Sc -11, from about . 
fnou about 1 vs-1 1 2 to about Asn-1 14, from ab 
]»o from about Aru-lNl to about Gly-183. 






EF083-2 


from about Cm- A to about A a-40. 






EF084-2 


from about l.ys-140 io about Asp-1 42, from ab. 
}6(\ from aba in Arii-262 to about Gly-264 




_ 


EF085-2 


from about Asn-95 io about Asp-97, from aboi: 
1 14. from about Asp-25S to aaout Ser-260. fro: 
Scr-403. 






EF086-2 


from about I ro-i i- to about t uy-l 15, from ab; 
22*;. from about Asn-2% to aoout Gly-299, fro 
l.\ --3--8, from about A m-42, v to about Scr-432 






EF087-2 


irom about lo oA !2 to about ;'4y-115, from ab^ 
224. from about Asn-296 to anout Gly-299. fro 
Lvs-348, fro in about Asp-42X to about Ser-43'2 






EF088-2 


from about Pro-1 12 to about Gly-1 15, from aA- 
224. from about Asn-296 to .mout Gly-299, in. 
Lv --348, from about Asd-42: io about Scr-432 



uit Asp- 



bv-88, 
at Gly- 



Asn-53, 



-out Gin- 



Hit Tyr- 



: yr-80, 
about Asp- 



oout Arg- 



ot Asp- 
m about 



ut Ser- 
10 about 



bout Ser- 
A io about 



ait Scr- 
10 about 
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Table 4. Resiuu . . apnsr r r • .r b;a- m'-tkarini? \' 



EF090-2 


1:vr. nnit r-. : .o\ a. A: 






EF091-2 


!• -out < • - : m. •' - "-43. 






EF093-2 


! : • . Gout i " ;o .."-■a ^ ■ '-'-97. 







EF094-2 


lie a . ^out r - ~G abou" -\sn-316. 






EF095-2 


;Yr :-. .. rout m: 3> tr . m ut \' ai-330, from arr a 'i:t Asp- 
}>• . ;:omab a: r-.a- ; " io : '■ nil Gly-639. :a : 'o about 






EF096-2 


lie:;; .bout i • : - ^ a -:.G30\ from. :a - .:L Asp- 
ic), 






EF097-2 


i;v . mout '\ G- ; r~ 10 ..'vr. ■. ; iv-359. 







EF099-2 


from -bout (;ia-— to about Asp-47 } from abo: Gly- 
: • • om a' .a A-ii-JNn to ;mout Asp-289. 







EF101-2 


ima; :bout i.\>--'m tr .moat / -m-42, from a 1 ^ ' Asn- 
: - ;> . Horn a 1 i -m-2;ss :o -.-rut Glv-290 






EF 102-2 


•:a a; about .--..-m- ; ! - : m mom ^sp-316. 






EF103-2 


from .ibout Asimm io about ^lv-48. 






EF 104-2 


iiv.it .ibout 1:<-1A' to about mrs-237, from ar-. " ait Asn- 
36 . aom ar rut Ser-42 : to about Gly-423. fror. N -~o about 
Ser-4v0, fro: v. abort AsmrO<a to about Asnofl' -■ -'G7 to 
abi rt Lys-6 a ma, about Aap-727 to about C at Asn- 
"m ir- about Sn^r4. 1mm about Lys-771 to a'- . -m. about 
lle-mb to ar ml As;i-S?~. m :r. about Pro-A^ '. a 






EF105-2 


from about Scr--n to about Cbv-43 } from abom. r Gln-97, 
lV;: rr about i iin-22n to \bout Gly-222, from am a-out Glv- 

i o 






EF106-2 


trom /.bout . -.sp-' A to about v ' : y-75, from abo; ■.•■'it Asp- 
2"". !; om ab ;ut A or r A) to amut Arg-313. 






EF107-2 


in. m ::bout m • A ~ a about \sn-l 57, from A' : -out A ~ t-_ 
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Table 4. Residues Vmnrisr : Auimemc Epiiope-Bearing Pom. 



EF108-2 



EF 110-2 



EF111-2 



EF 119-2 



EF121-2 



EF122-2 



EF123-2 



EF 124-2 



19 1. !:om about A:l>27() to about Gly-272, fiv 
Lw33j, from about Asp-365 to about Asp-?* 1 
abo ut Asp-4: 7 , i'rom about Giv-485 to about '1 



from about 1 \s-142 to about Trp-145, from a ; ~ 
];A.. from abom /ulz-212 to about Gly-214. fro; 
Ayp-251, from about Asp-3S4 to about Asp-3S 
about Arg-4K3. from about Lys-491 to about G 
619 to about 0>ivAA4. from about Asp-656 to : 
l.y..-~ ] 7 to about A so.- '2 I. from about Scr-822 
a h ( u 1 1 r Tyr-l 1 3 7 to about l'lir- i 1 4 1 . 



from about iuo-123 to about Giy-127, from aA 



from about Lvs-207 to about Asn-209, from no 
a.su-248, from about Lvs-39o to about AspAAA 
about Scr-432. from about 4411-470 to about I lu 



from about Asp-9t) to about Asn-92, from abo 



from about y , sn- i 5 A to abom Asp-161, from ; 
Lys-353, from about Pro-65S to about Gly-66 
about "Scr-78A __ 



rom about A sn- 1 59 to about Asp-161, from a I. 
Lys-353, from about ProAoS to about Gly-660. 
about Ser-78A 



from about /-.sn-331 to about Arg-336, from ab 
63o. i'rom about Glu-780 to about Ser-782, from 
Asu-9 1 1, from about Lvs-939 to about Glu-942. 
about Gly-b ; >. from about Asp-1367 to about 
Pro- 1433 to about LvsA43:A from about GlvA 
from about i.ys-!656 to about Asp-1660. from 
Glml863, fiom about Scr-1916 to about Gin- 19 
to about Gh A 942. 



from about Asn-33 1 to about Arg-336, from a: 
63r, from about GIu-SO to about Ser-782, fro> 
As; 'A 1, from about Lys-939 to about Glu-9A 
about Gly-1 ' "6. i'rom about Asp-1367 to abru: 
pr,,- 1U33 to about Pvs-1435. from about Gly- ' 



- o about 
!-o-45 1 to 



4 out Tyr- 
• to about 
! -481 to 
00 Thr- 
• : om about 
A 1 , from 



"Hit Glv- 



"OUt 

-429 to 



;t Gly- 



oOUt 

-^86 to 



about 
s-786 to 



bout Gly- 
! o about 
-1074 to 
about 
-1518, 
o about 
•o-i 940 



aoout Gly- 
;o about 
• 1074 to 
m a! bout 
oa-1518. 
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Table 4. Residues i .■: .oris: 



-Bearing 





Ao: . a-out , ^ - . . u . .-''.sp-1660. :r a, 
\ hi A A, 1; ;. . a: : : 1 . : to about G'.nA 
t ) a oat Gb . -■ - 






EF125-2 


l : oi . a* -out • • -A :.- ,000a, re-336. iroo, 
(v6. fiam a'' oa. ■ b..-~- a at Ser- 7 S2. :; 
A st v A A jr. ,.i io -A— about Go. 4 
a. bo 0 ily- ) a i ^p-I 367 to a bo a 
1 ' ro • ;~33 to o\o.t 1.ys-!4.Aa :rom about Giy- . 
l:or , aAout 1 ;. <- A ^ a :Ao:o Asp- 1660. fro:: 
Ah A -A. f; ::. Ao a >::-A J ! - to about Gb.- ' 
to a-o'A Gb. - '. 






EF 126-2 


Aora. about : a: A3" to aAoo ' yr-23 9. fro: v. a. 'a 
352. from a u a. ■. --4 ! a to about Asn-4 ] S. :V. 
Asp-44S, fi\ oo. aoout Asa-4a v ' to about Lys-4a . 
aboat Asp-5 A ; . from about Gaa-639 to about ■ 






EF127-2 


from about baoA A- ; u: ; bout ': or-239. from ab, 
352. from ab. io. io--- : 1 5 io a^out Asn-4 IS. b , 
Asy -44$, fn ::: .uoob Aoi-4S^ l .o about Lys~A 
aboat Asp-5 . v from afoot Go. -639 to about . 






EF128-2 


from about >ar-A ; ^ to ; boo: Avr-239. front aA 
35A from a 'a .a. A s-4 ! 5 to .oout Asn-4 IS. fro 
Asp-448, fro::-, aba-::: A^::-4So to about Lvs-4^ 






EF129-2 


:ror . aoout 4-::Aou m about Gly-302, from ab 
3 19. from about A sn-3b5 to aoout His-387 






EF131-2 


from about b\ sA.O! to about yr-204, from ab 
266 






EF132-2 


fro::: about AurA^ to about A::-2S. 



aDOllt 
A 940 



ait Giv- 
a bo Lit 
- : 074 to 

' v Ut 

A518. 
about 
•A940 



■at Gly- 
o abotit 
A o to 



on Gly- 
.o about 
' 1 6 to 



-at Gly- 
o about 
5 1 6 to 



at Glv- 



ut Ser- 
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What Is ^Liimcd i 

1. An iso. mm nuclei, ... v; p. prising a p< ; : cieotme 
sequence mlccecl fro: ' c ope pg ot: 

(a') a nuclc mm sequee. c wo \::ng wv we of the wow ' e 

polypeptides wow n i: . w ,, or 

(b) a nude ,>twe >equee .. . . w.pler. eppe ; 10 anyone-' • . wes in (a), 

(c ianuclc 41,:^ sequepw w. .cm: 0 : . wcmical to any ■ >w wquences 
shown in Tabic 1 ; or. 

(d) a nucleotide sequepw wust9ew pwptical to a m;m- w.piementary 
to any one of the nuci.- -■ wpie::. w shown in Table 1 

2. An iso...te : i:::cic;. p leco . w . .aim 1 com: : o. . ■ which 

hybridizes un .ier strirw cw :\\ nridiyamv, conditions to ^ p cm: a 

nucleotide sequence i;::::;;.-;: :o a :pw- woe sequence m 

3. An isolated miclek ... ... pwleculc o; "claim 1 comprm:.. which 

encodes a:: e;:wpe-b. ... : piop o: c polypeptide in w 

4. The isoiatwi nucleo ;.co: molecule o:" claim 3, where:' c wg portion 
of a polypeptide comw.w w; amino mm sequence Imce 

5. A method tor making recombmam \ ector conipn.' ::: . wed nucleic 
acid molecule of claim 1 :i:'o a vec,.':'. 

6. A recombmam vew w produced by the method of ckep 

7. A host cell comprising me vector of claim 6. 

8. A method of proelec;: w polypeptide comprising: 

(a) growing the host ceil o; waim ~ mch that the protein : :c cell; and 

(b) recoveiin . the expe^o: polypeptide 

9. An isolate:: polypep: we rompiwwig e polypeptide seiw op 
consisting of 

(a) a polypeptide cowwmw of one of me complete amne ■". Table 1; 

(b) a polypeptide conowow of one me complete amino aw Table 1 except 
the N-termmal residue. 
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(c) a fragment of the p ] e "ede o! 

(d) a fragment of the y Y\ \" °' 
polypeptide of (a). 



having biologic; 
w hich binds to 



10. An isolated antibo; 

11. A polypeptide pro. 



.v.fic for the polypeptide of c'. 
according to the method of c: 



12. An isolated polypeptide comprising an amino acid >.eq. 
a sequence selected from me group consisting of an amm< 
the polypeptides in Tabic \ 

13. An isolate:: polype- m.c antigen comprising an amino . 
faecalis epitope shown m > aole 4. 

14. An isolated nucleic aem molecule comprising a polvm: 
sequence encoding a polypeptide o! eiaan 9. 

15. A hybndoma which pi ounces a:- antibody of claim b 



; or the 



identical to 
any one of 



"an E. 



uclcotide 



16. A vac e inc. comprising: 

(1) one or more E. fa, . m •• polypeptides selected from m 
polypeptide of claim 9: an : 

(2) apharmaceutically acceptable diluent, carrier, or cxcip. 
wherein said polypeptide is present, m an amount eflccPo. 
antibodies in an animal to a member of the Enierococats g 

17. A method of preventing or attenuating an infection can 
Enterococcus genus in an animal, comprising administer:; 
of claim 9. wherein said, p emeptide is administered in an 
or attenuate said infection 



of a 



r of the 

a polypeptide 

• to prevent 



18. A method of detemi 



icroc 



act us riucleic acids ir 



comprising: 

(a) contacting the sample with one or more nucleic acids 
such that hybridization occurs, and 

(b) detecting hybridization of said nucleic acids to the or 
nucleic acid sequences prevent in the biological sample. 



conditions 



x:cus 
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19. A me: v : :e:e. ' 
from an a:.;nd,i. eomr; : 
(a) ampli iae o::e o: : . 
polymera- . ci.air. rea^d 
(bj detec .g - aid a:;aa 



v ni:e;eic acid: 
us micleic add 
-us nucleic aca 



Viained 



:e using 



20. A kit : r detecting 
animal, o d.ir: ising 

(a) a poh peptide of cl 

(b) delecidig means. 



uibodies in a ' 



a solid supp 



from an 



21. A mo ,d. oi" detect: 
from an aama;l. compri.* 

(a) conta. did: the samp 

(b) detecdae antibody-. 



ddid i . :ts antibodies id 

: a jd v peptide of claim 
a cor.ddexes. 
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